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CYCLICAL BEHAVIOR IN PNEUMOCOCCAL GROWTH AND 
TRANSFORM ABILITY OCCASIONED BY ENVIRONMENTAL CHANGES 


By Rouiun D. Horcuxiss 


LABORATORY OF PATHOLOGY AND MICROBIOLOGY, THE ROCKEFELLER INSTITUTE FOR MEDICAL 
RESEARCH, NEW YORK 


Communicated by R. J. Dubos, December 23, 1953 


The availability of a system in which drug sensitive bacteria are transformed to 
drug resistance through the action of desoxyribonucleate (DNA) extracted from 
resistant organisms! makes possible a quantitative determination of the number of 
bacteria transformed under given experimental conditions. It is only necessary to 
determine the number of organisms which survive after suitable exposure of the 
treated culture to the drug to which it was originally sensitive. In this way a 
study has been made of the conditions which predispose an unencapsulated pneu- 
mococcal culture (R36A, derived from Type II’) to be transformed to high level 
streptomycin resistance. 

An inoculum of 103-104 cells/ml. from a fresh 8-hour culture is grown in a neo- 
peptone-meat infusion medium containing 0.2% of bovine plasma albumin and 
buffered at pH 7.5-7.8 with phosphate. After 3'/.4 hours at 37°C. the culture 
ordinarily responds to transforming factor following as little as 10 seconds exposure 
to 1-5 ug./ml. of DNA from a single high-step mutant strain resistant to over 2000 
ug. streptomycin/ml. Exposure to DNA is terminated, usually after 5 minutes, 
by addition of ca. 1 ug./ml. of crystalline pancreatic desoxyribonuclease, a technique 
first employed by McCarty, Taylor, and Avery.* When 1 to 5 ug./ml. of DNA 
acts upon a young population of 10* to 107 cells/ml., conditions are sufficiently near 
saturation so that the number of susceptible cells is the limiting factor determining 
the number of transformants produced. The latter are scored by dilution into a 
nutrient broth containing 150 ug. streptomycin/ml. and a small amount of anti-R 
phneumococcus horse or rabbit serum globulin. The antibody causes the progeny of 
each viable unencapsulated cell to cluster as a single agglutinated colony which 
slowly settles during overnight growth and can be counted on the bottom of the 
culture tube. The demonstration of the validity of this bacterial counting method 
will be described in an article in preparation, in part by showing that correspondence 
with the more conventional methods is close. Within the appropriate range, 5 to 50 
microcolonies per culture tube, inter-agglutination and overlapping of colonies are 
minor problems. During two to three years’ use, the method has proven as reliable 
and accurate as agar plate counts within the same range, and it is more convenient 
and satisfactory for pneumococcus work. Scoring of transformants is done about 
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1-1'/. hours after DNA action has been stopped, a time when they have already r 
developed drug resistance but well before their number has begun to increase I 
through multiplication. i 

After the original inoculum has grown for 3'/24 hours, a fraction of the popula- f 
tion (0.2-5%, occasionally up to 17% in current practice) will respond to trans- t 
forming factor. The fraction remains relatively constant during another 2 or more a 
hours, then declines. Quantitative work has therefore confirmed similar qualita- 
tive conclusions derived from a study of the capsular transformation system.* 

Once having been exposed briefly to a transforming agent, the susceptible cells 
are essentially irreversibly affected so that a variety of disturbances, changes in 
environment, inhibitors, etc. alter the ultimate number of transformants only to the 
degree that they modify growth of the whole culture. This stability permits the 
increments of transformable cells, which appear as the culture develops, to be esti- 
mated by comparing the yields after short and long exposures to DNA. The U 
results of such studies tend to indicate that groups of susceptible cells are succes- 
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TABLE 1 
NUMBER OF STREPTOMYCIN RESISTANT TRANSFORMANTS RECOVERED AFTER DIFFERENT SUCCES- 
SIVE ExposuRE TIMES 


ESTIMATED 





TIME OF YIELD OF 8 ESTIMATED NUMBER OF TRANSFORMANTS DERIVED FROM NEW SUS- 
EXPOSURE TRANS- ——~AN ASSUMED SUSCEPTIBLE GROUP APPEARING AT TIME peat CEPTIBLES 
TO DNA, FORMANTS 0-10 mIN., 10-20 min., 20-30 miN., 30-40 MIN., 40-50 MIN., DURING 
MIN. PER ML.® GROUP | GRouP 2 GROUP 3 GROUP 4 GRouP 5 INTERVAL 
0-10 10,000 10,000 0 0 0 0 10,000 
0-20 18,000 (10,000) 8,000 0 0 0 se 
10-20 14,000 6,000 (8,000) 0 0 0 8,000 
10-30 20,000 (6,000) (8,000) 6,000 0 0 sion 
20-30 16,000 2,000 (8,000) (6,000) 0 0 6,000 
20-40 30,000 (2,000) (8,000) (6,000) 14,000 0 sich 
30-40 17 ,000 0 0 3,000 (14,000) 0 14,000 
30-50 24,000 0 0 (3,000) (14,000) 7,000 wey 
40-50 10,000 0 0 0 3,000 (7,000) 7,000 
“ Average of three determinations; estimated error 10%. Values appearing in parentheses 
are taken from the experiment of the next preceding line. I 


sively developing and disappearing as the culture grows. This inference was first 

drawn from experiments of the type illustrated in table 1. Experiments indicate | r 
that it can be assumed that the DNA used is near a saturating concentration, and | 
that cells in which transformation is once initiated will not revert. It now becomes t 
possible to infer the number and subsequent history of these susceptible cells by r 
observation based upon overlapping time intervals. It will be noted that there is t 
evidence of new transformants whose development was initiated during the latter 


half of each 20-minute exposure. Since the total number of transformants remains f 
relatively constant, it is concluded that some of those first susceptible later cease to i 
be so. This mode of analysis of the data leads to the identification of the “groups” 8 
of cells indicated in the table. It may be'seen that these groups in all cases seem to t 
retain their susceptibility for somewhat less than 20 minutes; e.g., “group 2” 8 


appeared at some time between 10 and 20 minutes, and was no longer detected by 
30 minutes. As a consequence of their being defined as made up of susceptible 
cells which “appear” during a certain time interval, these groups are of course 
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maximal at first and then diminish with time. It seems possible that other, 
physiologically more homogeneous, groups might be defined, if the appropriate 
physiological criteria were known, and that these groups would go through a rise and 
fall more symmetrically distributed in time. In any case, such classes derived from 
the present data and assumptions would almost certainly have the same periodicity 
as those indicated in table 1. 

It should be added that longer periods of exposure to DNA give the expected 
greater proportion of transformants, provided that an effort is made to minimize 
and control the incidental effects of adding material and diluent upon the state of 
the culture. The conclusion from many experiments of the types mentioned is 
that groups or classes of organisms in the whole population are becoming trans-« 
formable, remain so for about 15 minutes and then lose this potentiality. The 
proportion which the susceptible cells will reach in a given culture is reproducible 
under given conditions, but is influenced by changes in the medium and environ- 


TABLE 2 


INFLUENCE OF INHIBITING NucLEIC Acip AND TIME OF EXPOSURE ON NUMBER OF 
TRANSFORMANTS RECOVERED 


RECOVERED 


ADDITION OF DURATION OF TRANS- TRANSFORMATIONS INITIATED DURING 
INHIBITING DNA ———= I TLPOSUEE TO-— marin FORMANTS ——EXPOSURE TO TRANSFORMING DNA-—-—~ 
RELATIVE TO INHIBITING TRANSFORMING (NUMBER/- DURING AFTER PER CENT 
TRANSFORMING DNA DNA DNA ML.) 1sT MIN. Isr MIN. INHIBITION 
no inhibitor oh 1 min. 44,000 44,000 ae 
no inhibitor ae 8 85,000 (44,000) 41,000 
no inhibitor yee 15 84,000 (44,000) 40 ,000 
no inhibitor ba 15 108,000 (44,000) 64,000 Bee 
1 min. after 14 min. 15 54,000 (44,000) 10,000 75 
simultaneously 15 15 20 ,000 10,000 (10,000) 77; (75) 
6 min. before a 1 250 250 100 
6 min. before 14 8 13,000 (250) 13,000 68 
6 min. before 21 15 15,000 (250) 15,000 63 
6 min. before 51 15 19,000 (250) 19,000 70 


“ Average of three to six determinations; estimated error +10%. Inhibiting calf thymus 
DNA, 100 ug./ml.; streptomycin transforming DNA, 1 ug./ml. Values in parentheses are taken 
from applicable experiment of a preceding line. 


mental conditions in a manner which remains for the moment somewhat un- 
predictable. What seems of interest is that such relatively small fluctuations in the 
total number of transformants as are found in the (alternating) 10-minute experi- 
ments of table 1 seem to be attributable to rapidly changing populations of suscep- 
tible cells. 

Further corroborative evidence has been obtained by taking advantage of the 
fact that a pneumococcus DNA not bearing streptomycin resistance factor, or 
indeed even calf thymus DNA in somewhat larger quantities, will compete with 
streptomycin resistance factor and interfere with transformation. As shown in 
table 2, inhibition by previously added thymus DNA is virtually complete for a 
short period (1 minute) after transforming DNA is added. Soon afterward, more 
cells become transformable, and from then on, the inhibition is only about 70%, 
corresponding to what is obtained when both DNA’s are added together. This 
experiment was not carried out under conditions suitable for determining the rate 
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of transformation during prolonged exposure to DNA, since the cultures were not 
left undisturbed (see below). What this and other similar experiments do strongly 
indicate is that groups of susceptible cells (more or less uniform in responsiveness 
to either of two kinds of DNA) do appear from time to time within the growing 
culture. 

The results just described confirm the impression that ability to react with DNA 
is a physiological property* and do not support the view that it is the result of a 
mutation’ or a terminal change undergone by certain abnormal cells. The strong 
suggestion of large groups of cells going through periodic change seemed to lead to 
the hypothesis that susceptibility might be connected with the bacterial division 
cycle, there being a certain phase of the cycle during which a limited fraction of the 
cells might be supposed to be most reactive with DNA. If this were true, any in- 
fluence which might tend to produce even a modest degree of synchrony of cell 
division cycles in the whole bacterial population, would tend to increase the heights 
and depths of the “waves” of susceptibility. These should be relatively easy to 
demonstrate accurately enough, since the number of transformants should rise from 
zero to many thousand and fall back again, while the supposed regularities in the 
cell divisions would at best be expected to give only small periodic changes in the 
slope of the growth curve. For this reason, in most of our experiments, transforma- 
bility has been the criterion for observation of possible synchronous behavior. 

It was felt that the biochemical processes of cell growth and division would be 
disrupted in systematic fashion by temporary exposure of a growing culture to a 
temperature well below that at which these systems had achieved a steady state 
equilibrium. Certain of the enzyme systems should now be less able than others to 
transfer an amount of substrate equal to that supplied them, and there would then 
be a tendency for the metabolism to be selectively slowed or even temporarily halted 
at certain points. This should give opportunity for different cells to come into 
phase with each other, with regard to the slow processes leading to cell division. 

ixperimentation along this line showed that pneumococcal cultures could be 
rather easily influenced in this manner so as to give cyclical changes of transform- 
ability. A culture grown at 37°C., brought in 1 to 2 minutes to 25°C. and kept at 
this lower temperature for 15 minutes, then subdivided into aliquots and all returned 
to 37°C., is “phased” with respect to susceptibility to transformation. The number 
of susceptible cells present at the end of cooling is high, usually well above that of a 
replicate which had remained at 37°C. The data regularly are consistent with the 
supposition that cells initially transformable at the commencement of cooling re- 
main so, and their number is usually materially augmented by cells newly become 
susceptible at the lower temperature. This state of the culture can be maintained 
at 25°C. for a period considerably longer than 15 minutes with little further gain, 
and with an eventual gradual diminution in the number of susceptible cells. In 
the investigation of a variety of times and temperatures of conditioning, the 15- 
minute period at temperatures of 22—25°C. has proved the most suitable. 

After being returned to 37°C., however, the phased culture loses within 15-20 
minutes virtually all its susceptibility to transformation, as indicated in figure 1. 
This change presumably represents the passage of the cells, accumulated in the 
susceptible state during cooling, into another state, characterized by low suscep- 
tibility. These effects of cooling and reincubation are reproducible and the drop in 
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susceptibility always occurs within 15 to 20 minutes after replacing the culture at 
the higher temperature. Such a fall may be attributed to the decay of almost any 
type of active state, superimposed upon another, rising, function. 

There are signs, however, that transformability depends upon some type of 
cyclical process, since within a few minutes, susceptible cells begin to appear again 
and reach a new level, still higher in most instances than that previously attained. 
It would be difficult to ascribe the successive rises and falls in the curves of figure | 
to any combination of metabolic-accumulation-and-decay processes except one that 
is periodic in nature. As indicated, a second period of cooling may accentuate the 
apparent synchrony at subsequent peaks. This is the case when the cooling periods 
are separated by 40-45 minutes; when 15-30 minutes apart, the second low temper- 
ature treatment may obliterate or at least modify the effect of the first, giving a flat 
or highly irregular susceptibility-time curve. 
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of culture at 25 °C. for 15 minutes, then returningitto diately following 15 minutes’ conditioning at 

37°C.. The heavy curve shows the result of onecon- 25°C. Time shown is time of reincubation 

ditioning period. Another culture (lighter line) was at 37°C. Each point is the result of 4 to 6 

given a similar treatment, and an aliquotefthe same determinations with estimated error of 

(dashed line) was submitted to a second conditioning about +10%. The indicated division 

45 minutes after the end of the first one. time and logarithmic increments are for- 
mally derived from the drawn curve and 
only approximately determined by the 
experimental data. 


All of these findings lead one to inquire to what extent these cyclical happenings 
may be related to the cellular division process. Under the conditions used here 
(complex media, late log phase, 37°C.) the division time is from 30 to 35 minutes for 
pneumococci. The accumulation at 25°C. of susceptible cells and their decay 
during the next 15 minutes at 37°C. is not inconsistent with the time relations of 
cell division. The subsequent rise, however, is not easily to be correlated with the 
average division cycle of the population, since it appears to have a period of 40 to 
60 minutes and be generally somewhat ill-defined and unpredictable. Never- 
theless, the gradual flattening of later peaks farther removed from the initial tem- 
perature conditioning suggested that some type of partially synchronous cyclical 
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behavior had been brought about which was subsequently becoming more ran- 
dom. 

It has become clear that nutritional influences, the presence of serum albumin, 
and the aeration of the culture incidental to redistribution of aliquots after condi- 
tioning, all are able to modify the rate of regain of susceptibility after it has been 
lost. These factors might obscure the supposed dependence of transformability 
ipon a 30-35 minute division cycle. It has therefore seemed wise to return to the 
‘riterion of total cell growth for evidence of the synchronous metabolic cycles pre- 
sumed to underlie the cycles of transformability. 

Growth curves of temperature-conditioned pneumococcal cultures have given 
clear indications of partly synchronous discontinuities in cell division. The cultures 
were rapidly sampled at short time intervals into cold diluent broth, five to six 
replicate dilutions being made at each point. A curve showing parts of three waves 
of division is reproduced as figure 2. It will be seen that approximate two-fold in- 
creases in cell number were observed in two successive periods of about 30 minutes. 
Since untreated cultures give in general logarithmic doubling, spread uniformly 
throughout time intervals of the same order of magnitude, it is concluded that 
“phasing” of the culture has resulted in partially synchronous division. At present 
it still remains to be demonstrated whether the viable units separating at the steeper 
portions of the curve are individual cells, pairs of sister diplococci or, perhaps, 
fragments of short chains. Preliminary microscopic observations indicate that 
chain formation is not appreciable under the cultural conditions employed. It is 
not believed that simple errors of sampling and assay could destroy the significance 


’ of division or of the similar observations made in 


of this demonstration of ‘‘waves’ 
other experiments of this type. 

It will be noted that the first burst of division is a very rapid one appearing 
during the first few minutes after the culture is returned to 37°C. Later rises 
appear to be less “synchronous,” but it should be pointed out that one-half of the 
total divisions are indicated as occurring within the last 5 minutes of each 30-minute 
cycle. 

The correlation of these growth phases with transformability is imperfect, save 
perhaps for the phase immediately following reincubation, which is accompanied by 
a rapid loss of susceptibility by the transformable fraction of the population. The 
longer periods of the waves of susceptibility may be attributable to a different 
division rate in this part of the population, or to the additional influences affecting 
this property, as already mentioned. What does receive some support from the 
present observations is the hypothesis that bacterial division and transformability 
are both functions of cyclical metabolic activity which can be to some degree 
modified and rendered more nearly synchronous by environmental influence. — If 
this can be verified, it may make possible other applications of bacteria in cyto- 
physiological investigation. 

Hegarty and Weeks® (and other authors quoted by them) have reported evidence 
of “fission waves” of bacteria in untreated cultures. It has recently been learned 
that Zeuthen’? has independently also used temperature control as the basis for 
synchronizing division in the ciliate, Tetrahymena. Lastly, it should be mentioned 
that aeration and mixing, usually involved in the temperature conditioning as 
carried out heretofore, can in themselves tend to establish cyclical behavior in 
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pneumococci. The degree to which discontinuous oxygen shock can replace or 
modify the effect of temperature as a conditioning agent is under investigation. 

Summary.—The transformability of pneumococcal cultures appears to be the 
result of cyclical processes occurring in individual groups in the population. These 
cycles, and the somewhat more regular ones of cell division to which they may be 
related, can both be accentuated and apparently rendered more synchronous, by 
the conditioning effect of appropriate environmental change. 

1 Hotchkiss, R. D., Cold Spring Harbor Symp. Quant. Biol., 16, 457-460 (1951). 

2 Avery, O. T., MacLeod, C. M., and McCarty, M., J. Exp. Med., 79, 137-157 (1944). 

’ McCarty, M., Taylor, H. E., and Avery, O. T., Cold Spring Harbor Symp. Quant. Biol., 11, 
177-183 (1946). 

‘ Alexander, H,. E., and Leidy, G., J. Exp. Med., 93, 345-359 (1951). 

5 Alexander, H. E., discussion under Zinder, N., Cold Spring Harbor Symp. Quant. Biol., 18, in 
press. 

6 Hegarty, C. P., and Weeks, O. B., J. Bact. 39, 475-484 (1940). 

7 Scherbaum, O., and Zeuthen, E., Exp. Cell Research, in press, 1954. 


DOUBLE MARKER TRANSFORMATIONS AS EVIDENCE OF LINKED 
FACTORS IN DESOXYRIBONUCLEATE TRANSFORMING AGENTS 


By Rouu D. Horcukiss anp J. MARMUR* 
LABORATORY OF PATHOLOGY AND MICROBIOLOGY, THE ROCKEFELLER INSTITUTE FOR MEDICAL 
RESEARCH, NEW YORK 


Communicated by R. J. Dubos, December 23, 1953 


The ability of certain bacterial desoxyribonucleic acid (DNA) preparations to 
transfer heritable characters of a donor strain over to a related strain of bacteria! has 
suggested the implication that chromosomal DNA may play the major role in 
genetic determination in the higher organisms. It has not been clear, however, 
whether the “transforming agents’ should be considered the analogs of the genes, 
of whole chromosomes, or of some other level of organization. That they do not 
convey the total genetic complement of a donor strain to a recipient cell was already 
suggested by Langvad-Nielsen, who showed with the early techniques of in vivo 
transformation that capsular transformants of pneumococcus retained the same 
sulfonamide resistance as the recipient strain, acquiring, so far as was known, only 
the capsular property from the donor strain.? It has more recently been shown 
that drug resistance can indeed be transferred through DNA from a donor strain, 
but that the resistance factor and the capsular property are not in general both 
demonstrable in one and the same transformed cell. This is true both for strepto- 
mycin resistance and several agents conferring different quantitative levels of 
penicillin resistance in Diplococcus pneumoniae.* It was concluded that individual 
DNA particles had carried individual unit characters into the recipient cell. These 
results and others indicated close correspondence between the unit tranforming 
principle and the bacterial gene.‘ 

It was estimated that the rare doubly-transformed (encapsulated and penicillin 
resistant) cell appeared in a proportion approximately in the range of the product 
of the frequencies of the two individual transformations within the population. 
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TABLE 
Recovery OF MULTIPLE TRANSFORMANTS FROM PNEUMOCOCCI TREATED WITH VARIOUS TRANSFORMING AGENTS 


EXPECTED DOUBLE TRANSFORMATIONS 


OBSERVED TRANSFORMATION TO: 


FOUND 


PERCENTAGE 








EXPECTED 


EXPECTED 
0.0061 
0.01 
0.036 
0.06 


ROUTE 
Mxs8S 
Mx<S 
Mx<S&8 
MxS8S 
MxsS 
Mx<S 
Mx<sS 


« 


PROPERTIES, 


MS: 





EXPOSURE TO AGENT 


DN Ams 


EXPT 


14 
10 


0.091 


o.17 


32 
0.40 


0.6 


0 


M: 


2.0 


SIE 


5 min. 


10 min. 


d 


DN Ams 


0.16 


é 


15 min. 


30 min. 


DN Ams 
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é 


0.40 
0.00 


MS: 


2.6 


M: 


6 


2 


DN Ams 


0.2 
0.29 


0.012 
0.02 
0.10 


0.16 


3 


MS: e 


7 


0.8 
0.8 


M: 


5 min. 


10 min. 


DNAm + 


3 


‘ 
« 


0.0066 
0.41 
0.22 
0.0058 


MS: 
0.0071 


3 


M: 


2.8 
4.0 
4.0 


S: 


DNA 


DN Arsmp 


MS: 


SF: 


6 


M: 


10 min. 


da 


SXF 


MX F 


0.06 


0.10 


MF: 


0.0041 
0.0057 
0.016 


MXSXF 
SF X M 


MS X F 


MSF: 
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0.44 
31 


3 


« 
e 


0.0002 
0.04 


MF X8 


3 


€ 
e 


MxXS 


10 


0 


MS: 
SP: 


M: 


nN 


10 min 


DN Arsup 


4h 


0.4 


0.018 


a 


7 


0.00 


0.67 


P: 


Proc. N. A. S. 


This correspondence indicated that 
the double transformations could 
have resulted simply from succes- 
sive transformation events brought 
about by two independent single 
factor agents. Thus, if approxi- 
mately 1% of the cell population 
responds to either agent, one may 
expect 1% of 1%, that is, 0.01% or 
1 per 10,000 to have had the oppor- 
tunity to react with both. 

The development in this labora- 
tory of procedures for dealing with 
several new transformable charac- 
ters of Diplococcus pneumoniae has 
permitted a more adequate search 
for pairs of characters which might 
not behave independently in a 
transformation process. The char- 
acters used in this survey have in- 
cluded penicillin resistance (first 
step, ““P’’), streptomycin resistance 
(high step, ‘S’”’), and two not pre- 
viously described; sulfanilamide 
resistance (“F’”’) and mannitol 
utilization (“M’’, the ability to 
grow and metabolize mannitol in 
media in which this sugar alcohol 
is the limiting energy source). 
Cells bearing this last property are 
quantitatively determined in a de- 
fined medium in which a mixture 
of pure amino acids’ approximating 
the composition of casein hydroly- 
sate with a small quantity (0.005%) 
of peptone (Tryptone, Difco) di- 
alysate are supplemented by the 
vitamin and mineral mixtures de- 
scribed by Adams and Roe.’ With 
the addition of mannitol (0.3%) 
and sodium pyruvate (about 
0.01%) this medium fosters selec- 
tive growth of mannitol mutants 
The drug. re- 
sistant strains are scored and. iso- 


or transformants. 


lated in neopeptone-meat infusion 
broth made selective by addition 
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of either 0.05 unit/ml. crystalline benzylpenicillin, or 150 yg./ml. streptomycin, or 
in casein hydrolysate peptone media containing 15 yg./ml. of sulfanilamide. Com- 
binations of the drugs in these media allowed scoring for two properties. To all of 
the media were also added small amounts of anti-pneumococcal antibody globulin, 
which serve to isolate the growth in discrete colonies. The counting of colonies 
arising from appropriate dilutions in such media is the basis for the scoring 
method to which reference has already been made.’ 

It is found that two properties, mannitol utilization, M, and streptomycin resist- 
ance, S, are frequently both introduced into the same transformed cells during a 
short exposure to DNA preparations (1-5 yug./ml.) from donor strains bearing both 
traits. As illustrated in table 1, experiments | and 2, the number of MS trans- 
formants is commonly as great as one-fourth the number of M cells produced, al- 
though it is usually smaller in proportion to the 5 transformants — In these experi- 
ments, the number found is from 5 to 15 times the number of double transformants 
that would be “expected” on the assumption that they arose by successive or 
simultaneous random (S X M) transformations. These findings are similar to those 
from many other experiments and are illustrated again in the multiple factor trans- 
formations of 4a and 46, table 1. Here, values in the last column near to, or less 
than, unity indicate those routes by which (in any sequence) the multiple trans- 
formants could with reasonable probability have arisen. It will be seen that the 
assumption of random independent transformations is satisfactory for most pairs of 
characters except M and 8; even in the case of the rare triple transformant, MSF, 
the most acceptable hypothesis is that M and S were somehow acquired non- 
randomly, and the resultant MS cells were independently transformed with respect 
to the F property. 

The M and 8 characters have no apparent obligatory connection in the metab- 
olism (phenome) of pneumococci, each property having appeared independently 
of the other a number of times in spontaneous or induced mutations, or through 
transformation. Therefore, it becomes appropriate to ask whether the non- 
randomness of the MS transformations is due to non-randomness in the responsive- 
hess to the two transforming factors within susceptible bacterial populations, or 
rather to some structural relationship linking the two DNA transforming factors. 
A mixture of transforming factors, DN Ay and DN Ag, from separate M and 8S strains, 
should not be able to induce double transformations to MS if the latter principle is 
operating. Experiment 3 of table | illustrates that mixed DNA’s gave only a few 
MS transformants, well below the number that independent transformations could 
have provided. 

These calculations disregard the considerations indicated elsewhere’ that some of 
the cells in a culture are more predisposed to transformation than others, but they 
are probably not seriously unreliable as long as those cells potentially capable of 
responding, biologically, to a transforming agent are a relatively large fraction of 
the population. Actually the ability of one DNA factor to prevent the utilization 
of another by the same cell,” seems to be the cause of larger deviations, in the 
opposite direction. Multiple transformations believed to be unlinked occur less 
frequently than random, as indicated by factors in Table | varying from 0.06 to 
0.44, 

The results therefore indicated strongly that DNA derived from an MS strain 
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confers the M and 8 transforming agents together in some kind of association not 
found in a mixture of M and 8 transforming agents derived from separate cells. 
The possibility was considered that the association might be, e.g., an aggregation 
which the separate agents underwent during reprecipitation, etc., as the DNA prep- 
aration is made. Accordingly, a mixed culture of singly marked M and 8 cells 
(ratio M/S=1.3) was grown, and DNA prepared from it by the usual procedure. 
This preparation behaved like the artificial mixture of separately isolated.M and 8S 
DNA’s, giving virtually no MS transformations (e.g., when recovery of ca. 100 
cells/ml. would have indicated non-random transformation, none were recovered). 

It appears, therefore, that the special relationship of the S to the M transforming 
factor (but not to certain others) is the result of a special relation existing within 
the donor MS cell. It is natural to suppose that the association may be the equiv- 
alent of genetic linkage. The incompleteness of the linkage would have an obvious 
parallel in “crossing-over” which separates, to varying degrees, genetically linked 
factors in the higher forms. It might also be due to heterogeneity among the DNA 
particles derived from the MS strain. The manifestation of such phenomena by a 
preparation of DNA raises profound questions which cannot be answered as yet, 
concerning the relation of these properties of DNA to those of the chromosomal 
linkage groups in higher species. In particular, if one now is inclined to ask whether 
chromosomes also owe their linkage patterns to DNA structures rather than to 
connected bands of nucleoproteins, one is correspondingly obliged to question 
whether bacterial DN A’s showing linked properties can be prepared as free of pro- 
tein as the capsular transforming preparation, which contained none, or at most less 
than 0.02%.’ There is no indication that they do contain protein; this question 
will be further investigated. Linked transformations have, in any case, been 
brought about by DNA preparations resistant to the autolytic proteolysis and 
alcohol precipitation, and sustaining the removal of protein by surface denaturation, 
which occur in the usual procedure of purification. Furthermore, all of their 
transforming activities are lost in brief treatment with crystalline desoxyribonu- 
clease. T 

It is found that the same type and degree of linkage of M and 8 factors is exhibited 
by DNA from two MS strains, one of which had acquired its S property through a 
discrete spontaneous mutation (expt. 2, table 1) and the other of which was an M 
strain transformed by DNA from an 8 strain (expt. 1). Thus, the site taken up by 
the S determinant within the cell bears the same relation to the M factor when it 
arrives by mutation and when it is introduced by tranformation. This fact 
appears to give substance to the concept of a unique genetic “locus” in bacteria. 
The locus of the S factor may be supposed to be within the same DN A particle which 
bears the M factor. 

Further support, for the picture outlined above, is given by experiments on the 
transformations to drug sensitivity. Drug-sensitive bacteria, arising from a resist- 
ant population in small numbers, cannot satisfactorily be detected by a differential 
survival count. However, if streptomycin-sensitive organisms were produced in 
large proportion among the cells simultaneously acquiring the mannitol-utilization 
trait, they could easily be identified by examining an appropriate number of single- 
colony strains of the M transformants selected from the untransformed population 
by virtue of their M property. As indicated in table 2, the action of DNA from an 
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ordinary M (streptomycin-sensitive, ““SS—’’) strain sometimes confers, along with 
the M property, streptomycin sensitivity, displacing or inactivating the S deter- 
minant of resistance already present. Analogous results are recorded for the action 
of S factor in displacing M from an M strain. These findings again demonstrate 
that there is a unique locus within the bacterial cell capable of determining meta- 
bolie functions leading either to streptomycin resistance or to sensitivity. They 
also re-emphasize that this site and a corresponding one determining mannitol 
utilization stand in special relationship to each other within or upon the cell DNA 
particles. 

Finally, it may be mentioned that another pair of pneumococcal characters, 
S and F, seem to a lesser extent also to have given linked transformations, as in- 
dicated both in table 2 and in experiment 4, table 1. 


TABLE 2 


LINKED TRANSFORMATIONS INVOLVING Loss OF A SELECTIVE MUTANT PROPERTY 


TRANS- SINGLE COLONY STRAINS TESTED 
——-CHARACTERS PRESENT FORMANTS NUMBER OF DOUBLE 
DONOR RECIPIENT EXPT. SELECTED NUMBER OF TRANSFORMANTS 
DNA CELLS NO. FOR STRAINS FOUND 
M(S—) S(M —) l M 19 4 M(S—) 
2 M 9 1 M(S—) 
3 M é none 
4 M 10 1 M(S—) 
5 M 4 1 M(S—) 
S(M—) M(S—) 6 Ss 18 7 S(M-—) 
7 S 24 1 S(M—) 
8 s 30 4S(M—) 
MP(S—) M(S—) 9 P 10 (none M—) 
S(M—) MP 10 S 4 2 SP(M—) 
M(F—P-) PF(M—) 11 M 3 | PM(F—) 
S(M—F-—P-—) PFM(S—) 12 S 4 1 PMS(F—) 


Summary.—Several pairs of transforming agents give randomly distributed 
independent transformations in pneumococci. In contrast, desoxyribonucleate 
derived from a strain bearing the two mutant properties, mannitol utilization and 
streptomycin resistance, gives a high proportion of double transformations to both 
of these selective properties. The results suggest the material existence in the 
bacteria of specific genetic loci, involving these determinants in some mode of linked 
association within the desoxyribonucleate particles of the doubly marked strains. 
These sites may also carry factors associated with streptomycin sensitivity or 
incapacity to utilize mannitol, similarly linked, and capable of displacing the 
selective mutant property from a cell already bearing it. 

* Public Health Research Fellow, 1952-53. 

+ Of no little interest in itself is the fact that mannitol utilization, an adaptive system in the M 
strains of pneumococcus, is conferred equally well by DNA of these strains, whether they are in 
the adapted or the unadapted state. 

1 Avery, O. T., MacLeod, C. M., and McCarty, M., J. Exp. Med., 79, 137-157 (1944). 

* Langvad-Nielsen, A., Acta Path. Microbiol. Scand., 21, 362-369 (1944). 

’ Hotchkiss, R. D., Cold Spring Harbor Symp. Quant. Biol., 16, 457-460 (1951). 

‘ Hotchkiss, R. D., Exp. Cell Research, suppl. 2, 383-390 (1952). 

®> Rockenmacher, M., James, H. A., and Elberg, S. S., J. Bact., 63, 785-794 (1952). 

®* Adams, M. H., and Roe, A. S., /bid., 49, 401-409 (1945). 
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8 Hotchkiss, R. D., in Phosphorus Metabolism, edited by W. D. McElroy and Bentley Glass, 
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THE ISOLATION AND IDENTIFICATION OF THE MAJOR ANION 
FRACTION OF THE AXOPLASM OF SQUID GIANT NERVE FIBERS* 


By Bernarp A. KOECHLIN 
BIOLOGY DEPARTMENT, MASSACHUSETTS INSTITUTE OF TECHNOLOGY 
Communicated by Francis O. Schmitt, December 21, 1953 


It. has been known for some time that a large fraction of the anions in the axo- 
plasm of squid giant nerve fibers and in lobster nerves are organic in nature.'~® 
It could be deduced that the organic ions have an equivalent weight of about 100.* 
It was also demonstrated that lobster nerve contains high concentrations of free 
amino acids, particularly aspartic acid and alanine.’ However, even after this 
discovery the anion deficit remained considerable; some 60 per cent of the anions 
remained to be identified. From analyses of the free acidic amino acid content of 
invertebrate nerves, particularly those of Carcinus, Lewis* believed to have demon- 
strated acid-base balance to within about 10 per cent. These analyses were made 
on intact nerves rather than on extruded axoplasm. Therefore, acid-base distribu- 
tion between intracellular and extracellular phases could only be inferred. The 
data on electrolyte balance were thoroughly reviewed by Hodgkin.’ 

Because of the possible significance of the axonic anions in the biochemical proc- 
esses and bioelectric properties of the nerve fiber, efforts were made to determine 
the identity of the missing organic anion. It is believed that this has now been 
accomplished in the case of squid axoplasm. In this paper are reported the chief 
results of the analyses and identifications; the details will be reported subsequently. 

Method of Isolation of the Anion.—Axoplasm from squid giant fibers was prepared 
in the manner described by Maxfield.’ Individual samples consisted of aqueous 
suspensions of the pooled axoplasm from the nerves of 20 to 60 squid. The amount 
of axoplasm in each preparation was determined from dry weights taken for each 
pool and from our corrected value for the water content of extruded axoplasm (86.5 
per cent). For the isolation of the free anions either the aqueous suspension of 
axoplasm was dialyzed against distilled water at 0°C. or precipitated and extracted 
with 60 per cent ethanol at about —10°C. Both methods led to satisfactory ex- 
traction and essentially identical results. Isolation of the unidentified anion in 
practically quantitative yields was achieved by subjecting the extract to a sequence 
of ion exchange column operations as follows: passage through Dowex-50, partly 
in its silver form, removed cations and dipolar molecules and retained the insoluble 
silver salts of chloride and phosphates; passage through Amberlite IR-457 then 
removed the anions, allowing uncharged molecules to pass through; after elution of 
the anions from the Amberlite with base, the salts thus formed were passed through 
Dowex-50, partly in its mercury form, retaining the dicarboxylic acids and yielding 
the free remaining acid. In a typical isolation about 15 to 20 mg. of this acid, in 
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practically pure form, were obtained from the axoplasm of the nerves of about 100 


squid. 

Identification of the Anion.—The hygroscopic free acid which crystallized with 
considerable difficulty has an equivalent weight of 135 + 10, as determined on the 
barium salt. No measurable optical activity was observed. ‘Titration curves indi- 
cated a strong acidic group with a pK close to 2.0; no secondary inflection at higher 
pH, corresponding to a multivalent acid, was observed. That the acid is not car- 
boxylic was suggested by the fact that no ketone, ester, or lactone group, or un- 
saturation could be demonstrated, such as might confer strong acidity upon a car- 
boxylie acid. 

The clue to the identification of the anion came with the demonstration of a 
sulfonate group. The axoplasmic anion, on digestion with nitric acid, consistently 
liberated one mole of sulfuric acid per equivalent of acid. Since this did not occur 


TABLE 1 
Actp-BasE BALANCE IN Squip NERVE AXOPLASM 
CONCENTRATION 
M EQUIV. PER G. 
SUBSTANCE OF AXOPLASM ANALYTICAL METHOD USED 
Anions 
Chloride 150 + 20 Conway? 
Phosphates 35+ 3 Lowry and Lopez!® 
Aspartic acid 65+ 3 
Glutamic acid 10+ 2 Moore and Stein! 
Fumaric acid 2+ 5 Martins and Nitz-Litzow!? 
Succinic acid 
Isethionic acid* 230 + 20 Cuthbertson and Tomsett!? 
Total anions 500 + 20 
Total base 520 + 20 Ion exchange titration! 


* Sulfate liberated from anionic sulfonate. !* 


as a result of acid or alkaline hydrolysis the material could not be a sulfate ester. 
The presence of one primary hydroxy] group attached to a second carbon atom was 
indicated from the results on oxidation with chromic acid: consistently, two 
equivalents of oxygen were consumed per equivalent of anion under formation of 
divalent carboxylic sulfonic acid demonstrated titrimetrically. No ketone, alde- 
hyde, or sulfuric acid resulted. The presence of sulfonate and hydroxy! groups 
was confirmed by infra-red spectrographic analysis. 

These facts, together with the equivalent weight figure, strongly suggested that 
the substance is 2-hydroxyethane sulfonic acid (or perhaps its next higher analog). 
Evidence confirming this conclusion was obtained by demonstrating, by infra-red 
spectrography, the identity of the anion with a sample of 2-hydroxyethane sulfonic 
acid (isethionic acid) prepared by deamination of taurine. Both samples were 
prepared as the methy! esters. All the bands observed in the spectrum of the anion 
sample were present also in the spectrum of the synthetic compound. 

Acid-Base Balance.—From table 1 it will be seen that the amount of isethionic 
acid present in axoplasm is just sufficient, when added to the other anions demon- 
strated to be present, to balance the base. Detailed documentation of these ex- 
periments will be provided elsewhere. 
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Experiments designed to determine whether the predominant anion of lobster 
and other nerves is also isethionic acid are in progress as well as investigations of 
the possible role of this substance in the biochemical and physiological processes in 
nerve. 
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DIELECTRIC INCREMENTS IN AQUEOUS SOLUTIONS OF SYNTHETIC 
POLY ELECTROLY TES* 


By Howarp M. Dinrazis, J. L. ONcLEY, AND RayMonp M. Fuoss 


UNIVERSITY LABORATORY OF PHYSICAL CHEMISTRY RELATED TO MEDICINE AND PUBLIC HEALTH, 
HARVARD UNIVERSITY; AND STERLING CHEMISTRY LABORATORY, YALE UNIVERSITY 


Presented before the Academy, April 28, 1953; Communicated December 28, 1953 


Dielectric properties in aqueous solution have been reported for two general 
classes of charged macromolecules. The first of these classes, that of the globular 
proteins, has been studied in some detail; the experimental results have been con- 
sistent with reasonable assumptions concerning the size and shape of the molecules 
and the distribution of ionized groups." 2» Measurements on the second class, that 
of extended polyelectrolytes, have been limited until very recently to the nucleic 
acids and similar complex biological materials of relatively unknown structure.* * ° 
The electrical behavior of solutions of these two classes of macromolecules exhibits 
striking differences. 

Some conclusions concerning the size and shape of the nucleic acids have been 
deduced from dielectric measurements, using a model which represents nucleic acid 
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molecules as rigid rotating dipoles, although there is little independent evidence 
supporting such an assumption. It is therefore of interest to study the dielectric 
properties of polyelectrolytes of known molecular structure and charge distribution 
in an effort to correlate macroscopic and molecular parameters. Studies of this 
type have been reported recently for a synthetic polyelectrolyte® and for carboxy- 
methylcellulose.’ 

The poly-4-vinyl-N-n-butylpyridinium bromide used in our work was prepared 
from the middle fraction of a sample of poly-4-vinylpyridine whose reduced vis- 
cosity (nsp/C) at 0.200 g./100 ml. in 95% ethanol was 6.18. We estimate from this 
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FIGURE 1 
Dependence of dielectric constant of aqueous solutions of poly-4-vinyl-N-n-butylpyri- 


dinium bromide on concentration and frequency: 1, g = 0.0010; 2,0.0049; 3, 0.0162; 
4,0.050; 5, 0.40, and 6, 2.40 g./I. 


an approximate molecular weight of 2 X 10° Quaternization was effected by 
treating the polymer with n-butyl bromide in tetramethylenesulfone; the product 
contained 25.9% Br, which corresponds to 60.7% quaternization. The salt was 
slightly acid, potentiometric titration indicating that approximately 1% of the 
nitrogen was quaternized by hydrogen bromide. The salt was dried by lyophiliza- 
tion from water, followed by storage in a desiccator over solid potassium hydroxide. 
A stock solution was prepared by dissolving a weighed quantity of material in con- 
ductivity water made by resin deionization of distilled water. Dilute solutions 
were prepared by weight dilution of the stock solution. For the most dilute solu- 
tions, ali operations were carried out in the absence of carbon dioxide in order to 
eliminate the effects of extraneous electrolyte. 

Electrical measurements were made by the bridge method over the frequency 


range from | kilocyele to 4 megacycles. The type of apparatus used has been 
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described previously.* ° In order to minimize polarization and other errors result- 
ing from the wide frequency and conductivity range involved, calculations were 
based on the difference in behavior between the polyelectrolyte solution and a 
reference solution of potassium bromide of approximately the same conductance. 
Electrode polarization corrections were made at low frequencies by the empirical 
method of Oncley."” 

The dielectric increment Ae’ was measured over the concentration range, 0.0010- 
2.40 grams per liter; concentrations in these units are designated by the symbol g. 
The results are summarized in figure 1, where Ae’ is plotted against the logarithm of 
frequency for the six concentrations measured. The solid curves represent the 
original observations, while the dotted curves are corrected for polarization. It 
will be seen that the effect of electrode polarization capacity was small for fre- 
quencies above 10° cycles per second. The polarization, however, becomes a large 
fraction of the total capacity increment at frequencies below 10+ cycles per second; 
it is therefore necessary to regard the increments in the region of 10° cycles per 
second as tentative values on account of uncertainties in the correction. 


TABLE 1 


DEPENDENCE OF Di ELECTRIC INCREMENT ON FREQUENCY AND CONCENTRATION AT 25° C, 





_— — ——_—_____—— Ae’ /q- ee —E 
g 10% f = 103 104 108 106 4X 106 

0.0010 0.6 2200 1000 100 
0.0049 1.4 1400 980 310 oe Ae 
0.0162 3.4 1050 560 270 20 6 
0.0480 11.3 vies 140 37 6 
0.0500 8.9 820 280 135 32 6 
0.400 64 Pe 25 la 6 
2.40 323 ar: 5.6 3.5 2.7 
0.048" 5.6 290 200 100 15 3 


* Measured at 0.0°C. 


Values of the dielectric increment per gram per liter, Ae’/g, are given in table | 
for various frequencies and concentrations; conductances at one kilocycle are given 
(ohm~!em.~!) under the heading « in the second column. It will be noted imme- 
diately that the dielectric increment per gram for the polybromide covers an enor- 
mous range of values. The electrical behavior of the synthetic polyelectrolyte stands 
in marked contrast to that of amino-acids, peptides, and proteins in two distinct 
respects. First, for the latter group of naturally occurring substances,' Ae’/g has 
a maximum value of about 2, whereas the synthetic polyelectrolyte solutions show a 
value of Ae’ /g of several thousand at low frequencies and concentrations. Secondly, 
the quantity Ae’/g is almost independent of concentration in the case of amino- 
acids, peptides, and proteins, whereas in the case of the polyelectrolyte, a large 
concentration dependence is evident from the data in table 1. Natural polyelec- 
trolytes have been reported to have relatively large values of Ae’/g, some typical 
values being 200 for sodium thymonucleate,‘ 80 for thymonucleohistone,> and 120 
for sodium carboxymethylcellulose.’? These materials also show a strong depend- 
ence of Ae’/g upon concentration at constant frequency. 

It was possible to measure the changes in conductance (Ax) with frequency for 
most of the solutions. The values for the more concentrated solutions are shown in 











Vi 


fis 
sO 
th 


in 
an 
is 


TI 


Wo 
Ae 
de: 
cul 
in 

pre 








Vou. 40, 1954 CHEMISTRY: DINTZIS, ET AL. 65 


figure 2. This type of behavior is consistent with a purely a.-c. dielectric loss in the 
solution. The relationship to changes in dielectric increment can best be seen in 
the Cole plots of figure 3, where values of Ae” are calculated from the conductance 
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FIGURE 2 


Increment of conductance of polyelectrolyte solutions as a funce- 
tion of concentration and frequency: 3, g = 0.0162; 4’, 0.048; 
5, 0.40; 6, 2.40 g./I. 


increment from the equation Ae” = 1.80 X 10'* Ax/f. The curves for g = 0.048 
and 0.40 show that the in-phase component of the complex dielectric constant, Ae”, 
is due to the same mechanism as that which produces the dielectric increment Ae’. 
These two curves approximate a circular are with center below the Ae’-axis; in other 


TABLE 2 


Errect or ADDED PorassiIUM BROMIDE AT 25°C. 


: Ae’— _— 
108g 10° [Br] r f = 103 104 105 
1.90 1.59 0.0 7 4.8 1.6 
4.75 1.54 0.8 6 3.0 1.1 
4.60 1.49 1.6 5 2.2 0.8 
4.48 1.45 2.4 i) 2.3 0.7 
4.35 1.41 10.0 3 1.1 0.5 


words, the maximum in Ae” is less than half the estimated low frequency limit of 
Ae’. We therefore conclude that a distribution of relaxation times is necessary to 
describe the dielectric dispersion. The sharp rise toward lower frequencies for the 
curve at g = 2.40 is possibly due to polarization, for which no correction was made 
in this plot. The high frequency end of this curve, however, appears to be ap- 
proaching normal behavior. 
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Conductance measurements at high field strength'! on solutions of another sample 
of this polysalt show a dispersion of the Wien effect with a time constant of the order 
of microseconds. This observation is independent confirmation of the existence 
of a dispersion of dielectric constant and conductance with frequency, and also of 
the order of magnitude of the relaxation time deduced from the inflexion points of 
figure 1. 

The addition of simple salt (potassium bromide) to the polyelectrolyte solution 
produced a marked decrease of dielectric increment. The data are given in table 2, 
where g denotes polyelectrolyte concentration in g./1000 ml., [Br] bromide ion con- 
centration originally present as gegen ion in the polyelectrolyte in equivalents per 
liter, and r is the ratio [KBr]/[{Br]. A similar effect has been observed for sodium 
thymonucleate solutions.¢| This behavior is in sharp contrast to that of amino- 
acids, peptides, and proteins, which are little influenced by the presence of elec- 
trolyte, but is analogous to the suppression of other polyelectrolyte characteristics 
(such as viscosity'? and streaming birefringence'*) by the addition of simple elec- 
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FIGURE 3 


Cole plots from data of figs. 1 and 2: open circles, g = 2.40; left 
black, g = 0.40; right black, g = 9.048. 


The effect of temperature upon the dielectric increment and conductance was 
measured at one concentration, g = 0.048. Measurement at 0°C. and 25°C. (table 
1) showed that the pattern of the frequency dependence of dielectric increment was 
the same at both temperatures but the dispersion curve was shifted by a factor of 
0.40 toward lower frequencies at the lower temperature. At least part of this shift 
can be ascribed to viscosity changes, but no data for comparison are available on the 
viscosity of polyelectrolytes as a function of temperature. The Cole plot for the 
data at 0°C. coincided with that for the 25°C. data. 

Since our data do not permit evaluation of the low frequency limit for Ae’, it 
was not possible to obtain accurate relaxation times. It may be seen from figure 1, 
however, that the region of dielectric dispersion covers too wide a frequency range 
to be explained in terms of a single relaxation time. The dielectric dispersion 
shifted toward higher frequencies as the concentration was increased, in agreement 
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with the behavior of the natural polyelectrolytes, but directly opposite to the be- 
havior of the proteins. 

The similarity in all respects between the dielectric behavior of our synthetic 
polyelectrolyte and that of the natural linear polyelectrolytes suggests that the same 
molecular mechanisms are responsible for most of the dielectric behavior. Because 
of the flexibility of the molecular structure of the linear polymer and the purely 
cationic nature of poly-4-vinyl-N-n-butylpyridinium bromide, it seems extremely 
unlikely that there could exist a permanent dipole moment sufficiently large to 
account for the observed effects. A fluctuation mechanism of the specific type 
proposed by Kirkwood and Shumaker for proteins, which postulates a non-vanish- 
ing mean square moment due to the exchange of protons among equivalent basic 
sites, would not seem to apply here because the charges are fixed to the pyridinium 
nitrogens. 

The polyelectrolyte molecule may be regarded as a flexible, highly charged thread 
of great length surrounded by a relatively diffuse cloud of bromide ions, the spatial 
distribution of bromide ions being determined by the electrostatic potential. At 
very low concentrations the molecules will tend to stretch out and assume a rod-like 
shape because of intramolecular electrostatic repulsion between neighboring seg- 
ments of the chain. On account of the electrostatic attraction of bromide ions to 
the chain, the latter will tend to cluster near it, resulting in high concentration of 
mobile charge close to the macromolecule and low concentration in regions rela- 
tively faraway. A detailed analysis of the potential of an infinite rod-like molecule 
and the distribution of the counter ions has been presented.'* If the electrostatic 
screening due to small ions is increased by increasing the polymer concentration or 
by the addition of simple salts, the intersegmental repulsion along the chain will 
decrease and Brownian motion will lead to the contraction of the molecule into a 
random coil. 

In view of these considerations, the dilute aqueous solution of polyelectrolyte 
may be regarded as a heterogeneous medium, composed of regions of relatively high 
conductivity in the vicinity of the macromolecule, suspended in a continuous me- 
dium of considerably lower conductivity. The size and shape of the regions of high 
conductivity and the dependence of conductivity upon distance will be related in a 
very complicated manner to factors such as concentration, chemical constitution, 
molecular weight, charge density, and the presence of simple salts. We shall now 
show that such a model can lead to a dielectric dispersion which agrees in both 
magnitude and frequency with that observed experimentally. In addition, the 
dependence of electrical behavior on the concentration of polymer and on that of 
added simple salts may be qualitatively interpreted. 

The electrical properties of a polyphase dielectric consisting of a continuous 
medium of dielectric constant «’ and conductivity o: in which are suspended small 
spheres of a second phase of dielectric constant «’ and conductivity o2 was solved 
by Maxwell " and generalized by Wagner" (here, « denotes conductivity in e. s. u.). 
It was shown" that if the suspended spheres occupy a volume fraction, p<1, the 
bulk dielectric constant is given by the equation 


€ = «(Ze + € — 2ple — e)|/[Ze + e& + pla — e&)]. 


Here, both « and « are complex quantities of the form « = «’ — je” and e = 
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/ 


6 — je”. This relationship can be put into the following normal form by sepa- 
rating real and imaginary components: 


_ (1 ” k Jkwr ) , 
ee: 1 + wr? L + w?r? ) 


‘ 3p(e’ — €’) 
€. = ¢ 1 —- 
Je! + &’ 


k = 9p( a €2" — o2€')?/e' (Ze, oo €.') (20, + go)” (2) 


r = (2e’ + &’)/4r(20, + a2). (3) 


where 


If, instead of a dilute suspension of spheres, one takes as model a two-layered 
dielectric composed of sheets of two electrically different materials with the same 
thickness, the form of equation (1) is unchanged. The values of k and 7 in equa- 
tions (2) and (3) are changed only numerically by this drastic change in geometry. 
Experimental confirmation of the relative insensitivity of the Maxwell-Wagner 
dispersion to the geometric distribution of phases has been given by studies on a 
model system composed of quartz and alcohol.'’ The essential prerequisite for the 
Maxwell-Wagner dispersion is therefore the existence of boundaries between media 
of different electrical properties. Consequently we shall assume that the macro- 
ions and their associated counter ions may be represented by spherical droplets, 
although the actual geometry must tend toward cylindrical as dilution increases. 
Admittedly, the particles of such a pseudo-suspension would have no sharp bound- 
ary, but a fair approximation might be made by using a step function, defining as 
dispersed phase the regions containing polycations plus all nearby gegenions of 
potential energy nkT' (where n is a small integer), and as continuous phase all the 
rest of the solution. Due to the higher concentration of ions in the dispersed phase, 
the latter would have a higher conductance than the bulk of the solution; the 
necessary condition for the existence of a Maxwell-Wagner dispersion is thus satis- 
fied. 

In the case of dilute polyelectrolyte solutions, we may set 


ae eee 
so that 
k = 3p(o, — o2)?/(20; + o2)? (4) 
and 
t = 3e'/4x(201, + 2). (5) 


The constant k is related to the zero frequency dielectric constant, €, by the equa- 
tion 


é = e,(1 + k) 
so that, 


k = (@ — e€,,)/€ 


« 
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We may use the experimental values of Ae’ at a frequency of | kilocycle as a crude 
measure of Ae = (€ — e€,,) and thereby calculate effective values of p for various 
assumed values of (¢2/01) by means of equations (4) and (6). As may be seen from 
table 3, this procedure leads to values of p and (¢2/o,) which are physically reason- 
able. 

By the use of equation (5), we may calculate values of the critical frequency 


f. = 1/227, using the known total bromide concentration, values of p from table 3, 


and various assumed values of (¢2/0,). The resulting values of f, are given in 
table 4. It will be seen that the agreement with the experimental curves of figure | 
is quite good. The effects of increased polymer concentration and of added simple 
salt may be understood in terms of equations (4) and (5). In either case, the con- 
ductivity is increased and the difference in conductivity between the dispersed 
phase and the continuous medium is decreased; this leads to a decreased dielectric 
increment per gram and an increased critical frequency. 


TABLE 3 


CALCULATION OF p AS FUNCTION OF (02/0) 


g Ae (a2 pe ee 10 3 
0.050 10 0.17 0.30 1.05 
0.016 17 0.07 0.12 0.44 
0.005 7 0.03 0.05 0.18 
0.001 2 0.008 0.015 0.052 

TABLE 4 


CALCULATION OF CRITICAL FREQUENCY AS FUNCTION OF (¢2/0;) 


- - ~fe (KC.) — 
q 106 [Br’] 10& ao (02/01) = « 10 3 
0.050 160 12.0 590 310 100 
0.016 52 1.1 190 200 89 
0.005 16 1.2 330 83 37 
0.001 3.2 0.25 260 22 9 


The above discussion has been in terms of a uniform migration of charge within 
each medium due to the applied electric field. Such a migration might be consid- 
ered analogous to the mechanism proposed by Kirkwood and Shumaker,’ except 
that in this case the distribution of sites is a continuum. An alternative explana- 
tion of our results could be the relaxation effects in the ionic atmospheres of the 
polycations. Debye and Falkenhagen'® showed that the conductance of simple 
electrolytes exhibits a dispersion at frequencies corresponding to the time of relaxa- 
tion of the ion atmosphere. Such effects would presumably be enhanced in our 
case due to the high effective charge density of the polyelectrolyte. A dispersion in 
conductance means that x, the specific conductance, is a complex quantity, and the 
imaginary term of x = (x’ — jx”) represents an effective increment in dielectric 
constant (just as the loss factor e” in the complex dielectric constant « = «’ — je” 
can be represented as an equivalent parallel conductance). The fact that our 
observed dispersion region moves to lower frequencies with increasing dilution (i.e., 
with increasing dimensions of ionic atmospheres, and hence longer time constants) 
is in agreement with this mechanism. 
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Whether the observed increments in dielectric constant are primarily due to a 
Maxwell-Wagner dispersion or primarily to a Debye-Falkenhagen effect, or quite 
possibly to a combination of the two, cannot be decided on the basis of the facts 
available at present. The shape of the dispersion curves of figures 1 and 3 does 
indeed suggest the presence of two dispersion ranges, indicating two relaxation 
times (or two most probable values if a distribution of relaxation times is involved). 
Studies must be made on more homogeneous polymers, however, before this can be 
taken as definite evidence for the existence of two mechanisms. Finally, of course, 
we must not exclude the possibility that some hitherto unsuspected effect is operat- 
ing in these systems. 

* Acknowledgment is made to the Office of Naval Research and to the National Institutes of 
Health for joint support of the work reported in this paper. 
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THE RATES OF REACTION BETWEEN ALKYL HALIDES AND 
TERTIARY AMINES AS DETERMINED CONDUCTOMETRICALLY* 


By Rosrerr P. LARSEN AND CHARLES A. KRAUS 
METCALF RESEARCH LABORATORY, BROWN UNIVERSITY 
Communicated December 7, 1953 


The alkyl! halides react quantitatively and more or less readily with tertiary amines 
at ordinary temperatures. The product of reaction is a quaternary ammonium 
salt which normally is soluble in the solvent in which the reaction is carried out. 

Menschutkin' was the first to study the rate of this reaction some sixty years 
ago. He investigated the rate of reaction between ethyl! iodide and triethylamine 
in 19 different solvents at 100°C. Although numerous investigators have studied 
reactions of this type since Menschutkin’s time, accurate data are lacking for re- 
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action rates and we have little knowledge of the influence of various constitutional 
factors of reactants and of solvent on the rates of reaction. 

It seemed worth while to devise a more precise method for determining the rate 
of these reactions and to determine the rate constants for a number of typical re- 
actions. Since the quaternary ammonium salts are strong electrolytes, it seemed 
that a conductometric method might be successfully employed. Preston and Jones? 
employed this method earlier to measure the reaction rates between dimethylani- 
line and allyl and methy! iodides in ethanol at 25°. They found the rate constants 
determined conductometrically to agree well with those determined gravimetrically. 
However, they failed to take into account several sources of error as will appear 
below. 

The conductance of the pure salt, the product of the reaction to be studied, was 
first measured over a concentration range that included salt concentrations as 
produced in the course of reaction. In general, the conversion of reactants to salt 
was less than 0.1%. In most cases, the concentration of reactants varied between 
0.01 and 0.1 N. For a few reactions, where the reaction rate was low, the concen- 
tration of reactants varied between 0.1 and 1.0 NV. 

TABLE 1 
TyprcaL Rate DETERMINATION FOR REACTIONS OF ETHYL IopipE AND TRIETHYLAMINE 
IN ACETONE 
C, EtsN = 3.752 X 1072 


C, EtI = 3.8 <X 10-2 SPECIFIC PRODUCT p = 0.788 
TIME, RESISTANCE, CONDUCTANCE CONCENTRATION AC/ At 
SEC OHMS xX 105 «x 10° x 106 

176 40 ,000 4.00 1.99 a3 
309 24,000 6.66 3.31 9.93 
422 18,000 8.88 4.42 9.89 
552 14,000 11.41 5.72 9.91 
683 11,500 13.89 6.97 9.81 
810 9,800 16.30 8.22 9.81 
943 8,500 18.79 9.54 9.86 
1078 7,500 21.30 10.85 9.81 
1205 6,750 23.66 12.09 9.81 


With reactions of the trialkylamines in acetone, where reactant concentrations 
were low, it was unnecessary to apply a correction for the effect of the reactants on 
the conductance of the reaction product. For the reactions with pyridine, the 
conductance of the salt was measured in the presence of varying amounts of each of 
the reactants. In applying the correction for this effect, it was assumed that the 
conductance change due to the presence of both reactants was equal to the sum of 
the effects of the two reactants when present alone. Solutions of ethyl! iodide in 
the alcohols are not entirely stable, as Norris and Prentiss’ have shown for butyl] 
bromide in butanol. Solvolysis takes place with the formation of the halogen acid 
and the conductance of the solution increases slowly. The rate of this change was 
measured for solutions of ethyl iodide in methanol and ethanol and corrections were 
made to the observed conductance values of the reaction mixtures. 

Conductance measurements of the reaction mixtures, as also of the pure salts, 
were carried out in a cell of the Erlenmeyer type* having a capacity of 125 ce. with 
unplatinized electrodes. The measurements were carried out at 25 + 0.01°C. 
Resistances were measured with a precision bridge having a capacity of 100,000 
ohms. 
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Weighed quantities of solvent and amine were introduced into the conductance 
cell,. the alkyl halide was added, and after mixing the cell was placed in a thermo- 
stat, the resistance of the mixture measured, and the time noted. A new resistance 
was introduced into the bridge circuit and the time was noted when balance was 
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FIGURE 1 


Rate plot for reaction of ethyl iodide and triethyl amine in acetone 
at 25°. 


again reached. Thereafter, a new adjustment of resistance was made and this 
procedure was repeated until the desired amount of reaction had taken place. The 
cell was then weighed to determine the amount of alkyl halide and the density of 
the solution was determined with a Westphal balance. 

The equation for the reaction rate reads: 


dz/dt = KCC, (1) 


where x is the moles of product per liter of solution, ¢ is the time in seconds, K is 
the rate constant, and C, and C, are the concentrations of halide and amine. 
Since the maximum concentration change of the reactants is of the order of 0.1%, z 
is a linear function of ¢ within the experimental error. A plot of the concentration 
of reaction product vs. ¢ should be linear and its slope should yield the value of 
dx/dt, wherefrom the value of the reaction constant K may be computed. 

For the purpose of illustration, details for one reaction between ethyl iodide and 
triethylamine are given in table 1. At the head of the table are given values of the 
concentration of the two reactants and the density of the solution. Time appears 
in column 1, the resistance of the solution in column 2, its specific conductance in 
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column 3, and the concentration of the reaction product in column 4. In the last 
column, are given values of the rate dx/dt in moles per second for the time intervals 
as listed in column 1. There is some variation of rate for the earliest points, possibly 
due to lack of thermal equilibrium. 


TABLE 2 


CONSTANTS FOR REACTIONS OF ALKYL HALIDES WITH TRIETHYLAMINE 


C HALIDE C AMINE C HALIDE C AMINE 
x 102 x 102 K X 105 x 102 x 102 K X 105 
A. EtIl + Et,;N B. EtI + Pr:N 
2.592 2.051 6.98 3.84 3.28 1.53 
1.541 4.751 6.82 4.68 4.82 1.51 
2.439 4.277 6.84 §.32 5.68 1.51 
3.840 3.402 6.81 4.50 6.76 1.50 
6.77 8.17 6.57 7.08 7.65 1.48 
9.21 11.41 6.60 
C. EtI + Bu,N D. EtI + Am:N 
1.930 2.701 2.03 1.85 2.32 2.04 
3.772 2.996 2.03 2.06 2.66 2.10 
4.85 3.69 2.09 3.09 2.57 2.02 
6.01 4.64 2.09 4.00 4.01 2.02 
5.20 6.71 2.07 
EK. Bul + PrN F. Pri + Pr3N 
4.88 5.01 0.238 4.40 5.64 0.276 
5.69 4.47 0.242 4.94 5.39 0.277 
5.36 6.46 0.243 6.49 8.40 0.273 
4.74 7.12 0.240 7.02 8.86 0.269 
G. PrBr + Pr;N H, PrCl + PrN 
6.97 4.80 0.137 5.02 6.97 1.34° 
9.81 5.61 0.139 9.83 6.48 1 .22° 
8.59 7.44 0.134 13.30 6.06 0.90° 
*“K << 10. 
TABLE 3 
REACTIONS OF EtuyL IopiIpE AND PYRIDINE IN DIFFERENT SOLVENTS 
C Etl PYRIDINE C Etl PYRIDINE C Etil PYRIDINE 
x 10 x 10 K X 106 x 10 x10 K X 106 x 10 x 10 K X 108 
A. Acetone B. Methanol C. Ethanol 
0.938 0.735 9.24 2.76 2.58 1.20 1.852 1.584 0.776 
0.923 0.798 9.33 3.066 2.49 1.19 2.483 2.627 0.850 
0.965 0.926 9.21 2.94 3.21 1.20 3.91 3.12 0.882 
1.227 1.290 9.34 3.94 3.58 1.22 3.27 3.56 0.885 
1.318 1.155 9.19 4.45 3.82 1.21 3.28 t.00 0.916 
4.89 4.90 1.24 3.60 5.16 0.966 
4.92 5.75 0.984 
8.09 9.29 1.178 


A plot of the results is shown in figure 1. The value of dr/dt was found to be 
9.82 X 10-* moles per second. This leads to value of 6.81 X 10~° for the rate 
constant K. As stated above, it was unnecessary to make a correction for the 
effect of the reactants on the conductance of the salt for the aliphatic amines in 
acetone. 
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All the materials employed were carefully purified. The tertiary amines® were 
converted to picrates, recrystallized, and reconverted to amine. Pyridine was ¢ 
pure product supplied to us by Dr. C. J. Carignan, of this laboratory.6 Alkyl 
halides were treated with dilute caustic to remove halogens and redistilled after 
drying. Acetone’ was purified by the method of Reynolds.’ The final drying of 
methanol and ethanol was carried out with activated alumina. The specific con- 
ductance of the acetone ranged between 1 and 2 X 107°, that of methanol between 
1 and 0.5 X 1077, and that of ethanol between | and 0.5 X 10-°. 

The results of the present investigation are given in tables 2 and 3, where the 
concentration of alkyl halide appears in column 1, that of amine in column 2, and 
the rate constant K in column 3. Mean values of the rate constants are sum- 
marized in table 4. 

With the exception of two reactions, the rate constant is not dependent on re- 
actant concentration in the concentration ranges employed. For different series 
of reactions with given reactants, the average deviation of K from the mean is of 
the order of 1.1% with a maximum deviation of 1.7% for one reaction. 

TABLE 4 


Rate Constants FoR Reactions or ALKYL Hauipes with TERTIARY ANIONS IN DIFFERENT 
SOLVENTS 


REACTANTS RATE CONSTANT SOLVENT 
Etl + EtsN 6.77 X 10-5 acetone 
EtI + Pr3N 1.51 10 acetone 
Etl + Bu,N 2.05 <X 10> acetone 
EtI + Am;N 2.04 K 1074 acetone 
Bul + Pr3N 2.41 X 1076 acetone 
PrI + Pr3N 2.74 X 107* acetone 
PrBr + Pr;N 1.37 X< 10r* acetone 
PrCl + Pr:N l < lo acetone 
Etl + Pyr 9.26 K 10-* acetone 
EtI + Pyr 1.21 <X 1075 methanol 
tl + Pyr 7 x 107 ethanol 


The value of K for ethyl iodide and pyridine in ethanol increases markedly with 
increasing reactant concentration. In contrast, the constant for the same reaction 
in methanol and in acetone is independent of concentration. Whether the change 
of K in ethanol is due to some unknown source of error or not must remain uncertain. 
The alkyl halides are not stable in ethanol and correction had to be made for this 
effect. However, the same is true of methanol and there the constant exhibits no 
dependence on reactant concentration. 

The constant for the reaction between propyl chloride and tripropylamine in- 
creases markedly with concentration. Here, the reaction was very slow, the con- 
centration of reactants was high and time intervals amounted to several hours. The 
concentration of the product of reaction ranged between 0.6 and 1.0 K 10-5 N. 
Under these circumstances various sources of error might creep into the observa- 
tions. It may safely be concluded, however, that the rate constant of this reaction 
is of the order of 1 X 107°. 

It is of interest to compare the rate of the reactions between tripropylamine and 
propyliodide, bromide, and chloride. The constants for these reactions are, 
respectively, 2.74 X 10~*, 1.37 X 10~*, and 1 X 10~%. While the rate constant of 
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the iodide is only twiee as great as that of the bromide, it is two thousand times as 
great as that of the chloride. ; 

For the reaction of ethyl iodide with tertiary amines, the rate constant decreases 
greatly in going from triethyl to tripropylamine. The constants for tributyl- and 
triamylamine have practically the same value and both are markedly greater than 
that of tripropylamine, 2.05 X 10° and 1.51 X 10~°, respectively. It is difficult to 
account for this inversion. Conceivably, one or another of the amines was im- 
pure; if so, it must have been the propyl! derivative. However, this amine was 
carefully purified and there is no reason for doubting its purity other than the in- 
version under discussion. 

As the alkyl group attached to the iodine atom becomes larger, the rate decreases. 
Thus, for reactions of ethyl, propyl, and buty] iodides with tripropylamine, the rates 
are, respectively, 1.51 X 10~°, 2.74 X 10-°, and 2.41 X 10-*. The change from 
propyl to butyl is relatively small. 

The effect of solvent on reaction rates is of interest. For the ethyl iodide- 
pyridine reaction, the rate constant for acetone (9.26 * 10~*) is markedly greater 
than it is for methanol (1.21 * 10~*) which, in turn, is higher than for ethanol. As 
already mentioned, the constant for ethanol is dependent on the concentration of 
reactants; extrapolating to low reactant concentrations, one obtains the approxi- 
mate value 7 X 10-7. Norris and Prentiss* have determined the rates for the same 
reaction in these same solvents. For acetone, methanol, and ethanol, they report 
the values 10 X 10-*, 1.94 & 10-*, and 1.08 X 10-*, respectively. The values 
are in good agreement with those of the present investigation considering the much 
higher concentration of reactants which Norris and Prentiss employed. If we 
interpolate a value of K for ethanol at a concentration corresponding to that of Norris 
and Prentiss, we find K = 1.07 X 10~* in good agreement with the earlier value. 

There appears to be little or no correlation between the physical properties of the 
solvent and the rate of reaction. The dielectric constant of methanol is markedly 
greater than that of acetone, yet the rate in acetone is much greater than in meth- 
anol. The viscosity of acetone is less than that of methanol but not greatly so. 
Menschutkin! measured the reaction rates of ethyl iodide and triethylamine in 19 
different solvents and found the reaction rate to vary widely for different solvents. 
While Menschutkin’s measurements are subject to rather large uncertainties, they 
nevertheless, show that the rate of reaction is greatly dependent on the solvent 
medium. Further studies of reaction rates in different solvents would seem to be 
called for. Conceivably such studies would throw some light on the detailed mech- 
anism which determines the reaction rate and in what manner such mechanism is 
dependent upon constitutional factors of solvent as well as of reactants. In carry- 
ing out precise measurements, the conductometric method may be recommended. 

* This paper is based on a portion of a thesis presented by Robert P. Larsen in partial fulfill- 
ment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Brown 
University, May, 1948. 
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SONIC POINT ON SUPERSONIC AIRFOILS 
By T. Y. THomas 
INSTITUTE FOR NUMERICAL ANALYSIS, LOS ANGELES, CALIFORNIA 
Communicated December 4, 1953 


1. Introduction.—In figure 1 we have indicated an airfoil in a uniform supersonic 
stream of velocity W. Under the conditions assumed the shock AQ will be de- 
tached from the airfoil and the stream line AB, whose continuation is the contour 
of the airfoil, will be parallel to the direction of the uniform flow and perpendicular 
to the shock at the point A. The sonic line, i.e., the curve along which the mach 
number m = I, is represented by PQ. This line divides the region of the flow 
behind the shock into a subsonic (m < 1) region I and a supersonic (m > 1) region 
II. Itis the object of this note to determine the position of the sonic point P on the 
airfoil.! 

Introducing the entropy S as a parameter along the sonic line we can write 


dw 


wp = we + (“<) (So — So), (1) 
dS] pr 

where w denotes the inclination of the stream lines with wp and we the inclinations 

of these lines at points P and Q respectively. Also Sp is the entropy along the 

stream line A BPC, and hence is the entropy at the point P, while Sg is the entropy 

at Q. The derivative dw/dS in the above equation is evaluated at some point R 

between P and Q on the sonic line. Approximately we have 


dw 


) (So — Sg), (2) 


dS ‘a 


Wp = Wa + ( 


where the derivative is now evaluated 
at the point Q. The expression (2) for 
wp Will be correct to within terms of the 
second order in the entropy difference 
So — Sq and for flows in which the en- 
tropy changes are small it may be ex- 
pected that the relation (2) will give a 
good approximation to the inclination 
wp of the airfoil at the sonic point P. 

The remainder of this paper is de- 
voted to the derivation of the formulas 
for the quantities in the right member 
of (2). It is assumed in this discussion 
FIGURE 1 that we are dealing with an ideal gas 
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and that viscosity and thermal conductivity can be neglected. Computations? are 
carried out in the last section and in this section we have also shown the graph of 
the inclination wp as a function of the mach number M of the uniform stream. 

2. Determination of w2.—The inclination of the stream line at the point Q behind 
the shock is determined directly from the condition that m = 1 at this point. In 
fact we have*® 


sin? @ 


~ ty? ; (vy + DM? + (y — 3)) + V [Cv + DM? + (vy — 3) 22 + i6yt, 
y ¥: - } 
(3) 
where a is the inclination of the shock line at Q and y is a constant having the value 
1.405 for the case of air flow with which we shall be concerned. From (3) we find 
the value of a(<90°) at Q. The inclination w = we is then given by the shock rela- 
tion 


2((? — 1) tan? a — 1] 


nies eae : é 4) 
[(y — 1)M? + 2] tan* a + [(y + 1)M? + 2] tan a ( 


tan w = 
3. Inclination of the Sonic Line Relative to the Stream Lines.—The angle yw in 
figure | gives the inclination of the sonic line relative to the stream lines. Now 
from the fact that m = const. on the sonic line it follows readily that the flow veloc- 
ity v is likewise constant on this line. Letting / denote are length along the sonic 
line measured from P to Q we therefore have 
dv dv y+ dv . J 0 
= COS sin = 9 
dl do dn 
where o is the are length along the stream lines in the direction of the flow and n 
denotes the direction normal to the stream lines as shown in figure 1. Hence 
dv dv | dp 
tany = — = ’ 
dn do pv do 
where p, as usual, denotes the density. Let us now put p = p(w, S), i.e., we take 
w and S as the independent variables in the representation of the pressure function. 
Substituting for the derivative-dv/dn the above equation can then be written 
K Op 
(vK + Ap) tan y = ’ 
pv Ow 


where K is the curvature of the stream lines and A is an invariant of the flow which 
is constant along stream lines. Or, we have 


vy? pv? A | Op 
¥ oh -|tany = . 
yp/p yp/pvKk p Ow 
Hence putting c? = yp/p where c is the velocity of sound and m2 = v2/c? = 1 we 
obtain 
Op/Ow 


te = : = : 
aah yp(1 + D) vk 
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Explicit formulas are given in §6 for the quantity D and in §7 for the quantity 
(Op/Qw)/p at the point Q. Calculations based on these formulas, which have not 
been entered in this note, indicate that tan y can have both positive and negative 
values at Q depending on the mach number M and that this function will become 
infinite only in the exceptional case for which D = —1. Since the relation (5) is 
exact, computations are exact, i.e., it is possible to determine to any degree of ap- 
proximation the inclination y of the sonic line relative to the stream line at the 
point Q where the sonic line meets the shock. 

4. Formula for dw/dS.—It can be shown that‘ 


i l 
(m? — 1) 7 = mie 
do dn 
dS 
- = —(y — I)pA. 
dn 


From the first of these formulas we have dw/dn = 0 along the sonic line under the 
assumption that dp/do does not become infinite; this quantity is definitely finite 
at the point Q, with which we shall primarily be concerned in this note, in conse- 
quence of the following equation (24). Hence along the sonic line we have 


dw dw da dw dn 


- Se es Re. 
os ae ae 


dw A dw dS dn a ( ee da 
di dSdndtS* ome as 


The combination of these two equations now gives 


dS A 
(y - 1 ov ( ) tan y 
K 


A A y2 y2 
puA _ ‘ ) — D wv yD 


K vK/ yp/p c? 


da —1 : (6) 
But, 


along the sonic line. Hence (6) becomes 
dw ee 
v(y — 1D tan y 


5. The Curvature Ratio Go.—Between the curvature K of the stream line and the 
curvature «x of the shock line we have the relation K = Gox at any point behind the 


p Pi ee 22) (Hy = (° a: i)(“) 
* - mA + re v pi v n 4 (“). (8) 


(u,?/c?) — 1 “ | 


shock, where® 


—(G = 
v 


Here the subscript 1 is used to denote a quantity in the uniform incident flow. 
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The tangential velocity component is specified by u, and u, denotes the normal com- 
ponent of velocity behind the shock. Other quantities appearing in (8) have al- 
ready been defined. 

At the point Q where m = | it follows readily that 


(u,?/c?) — 1 = —(u,/c)?. (9) 


Making this substitution in (8) this equation can be reduced to 


2(y — 1) t ( — ] 
liu | (y = (° 1 *)\(“*) 4 [(o/ pr) | ei 
i oe pi p v U,/v 


Now 
[pl/o = (p — pi)/m = (p/m) — 1, (11) 
[p]/p = (p — px)/p = 1 — (p1/p), (12) 
and hence 
[e]?/ pp: = p/pr + (pi/p) — 2. (13) 


Using (11) and (13) equation (10) becomes 
. : 4 lp]? |/u lp|/p 
a-{ + ‘) — = (14) 
Ly + I ppi v U,/v 
We now make use of the following shock condition® 


—2p(pu,” — pp) , 
[p] = eae aes (15) 


Applying this condition at the point Q we have 


a —2p(u,? — c?) Zs —2p[(u,?/c?) — 1] 
PP 2B + (y — Dy? 2+ CY — Du,?/e? 
or, 
[p] 2(u,/v)? 2(u,/v)? 
= oe : a (16 
p 2+ (y¥— 11 — (w/e?) (Yt) -(¥- Dw/v)? .* » 
Now we can write 
fi. I 
p= ls — [p]/p 
and hence 
[op] [pl p [p|/p 7 
= = al (17) 
pi pp =o L — [p/p 


When we replace the quantity [p]/p in (17) by its value as given by (16) we obtain 
the following relation 


[ep] _ 2a* 


8 
a. e+ DG — af (18) 
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where we have put a = u,/v. Also we can write 

lo} _ en 

eo (y+1)—(y¥—- Da’ 
from (16). Now when we make the substitutions (18) and (19) in (14) we find, 
after some reduction, the following relation 


2a Ke — i= & + |: 
fe 


(19) 


Go = 


(y — la? — (y + 1) (20) 


Finally replacing a by its value —cos (a — w) equation (20) can be given the form 


G =_ —Zomle—o) E + 1) tan" (oe — ») + 3 57 ai 
5 y+ 1 (y + 1) tan? (a — w) + 2 “ 
6. The Ratio A/K.—It has been shown that? A4/K = —[p|?u,/p:ppGo at points 
behind the shock. Hence from the second equation (5) we have 
—[p]? fu ; 
Di = — le] ( ‘ = De cos (a — w). (22) 
ppl \v pipGo 
At the point Q on the sonic line we can make the substitutions (18), (19), and (21) 
in the right member of (22). After some reduction we thus obtain 
D = —2/tan? (a — w)[(y + 1) tan? (a — w) + 3 — ¥]}. (23) 


?. Formula for 0p/Ow.—At any point behind the shock line we have® 


op re (“ ) 4pvu, 
— = —g* — oe ° 
Ow 4 Ut (y + l )Go 


Since u, = v sin (a — w) and u, = —v cos (a — w) we furthermore obtain 
1 Op pv? 4 sin (a — w) 
‘iiieatens |... 5. teat. ieeelieewervaiemenne sae ‘a 
p Ow p ¥+ 1 Go 


Hence at the point Q we have 


1 Op [ ( ) 4 sin (a — 2) 
— = mina 
Pp Ow * Y + | GQ 
and then making the substitution (21) for Go we obtain an equation which can be 
written in the form 
1 Op 
p Ow 





es eer el a 


= 7 tan (a - w) | 34 1) tan? (a — w) +3-— 7 


8. The Entropy Difference Sy — Sg.—The entropy S in the non-isentropic flow 
behind the shock is defined by 


P/P1 
= log 


Ny (p pi)” 


At points immediately behind the shock line the ratios p/p, and p/p; are determined 


: \ 
y= log 
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from the shock conditions as follows 


27(M? sin? a — 1) 
p/P~i = (p/pi)e = - + “ + 1, 
2(M? sin? a fue 1) 


¥ 
(y — 1)M?*sin? a+ 2 + 


p/p = (0/ pide = 
By selecting the value of a given by (3) we can therefore find the entropy Sg at the 
point Q on the sonic line. 

The entropy So along the contour of the airfoil is found by taking a = 90° in the 
above expressions for p/p; and p/p; since the shock is assumed to be perpendicular 
to the stream line A BPC at the point A (see figure 1). Hence if we denote the values 
of the pressure and density ratios at A by p,/p; and p,/p; we have 


),/D 
So = log : i 
(py pi)” 
where 
9,  2y(M? — 1) 
f = a ie sy aaa + EF 
Pi a ae 
Py 2(M? — 1) ; 
a &~ D+ 2 , 
The above relations permit the calculation of the entropy difference Sp — Sg which 


occurs in the equation (2). 

9. Computations and Determination of the Sonic Point on the Airfoil—The 
method of determining the quantity we is given in §2 and the determination of the 
entropy difference Sy — Sg is explained in §8. The determination of the derivative 
dw/dS at Q which also occurs in the relation (2) is covered by the formulas of the 
preceding sections. Thus the quantity D is obtained from (23) when the value of 
a has been found from (3) and w = we from (4). These values of a and we likewise 
permit the determination of the quantity in the left member of (24) after which we 
obtain tan y from equation (5). With tan y and D determined we find the value 
of the derivative dw/dS at Q from equation (7). Hence we can determine the varia- 


tion 
iw (**) (Sy — So) 
alii dS Q si o 


in the inclination w and hence the value of wp by equation (2). 

In table 1 we have given the values of the quantities wg, 6a and w, for various 
values of the mach number M of the incident flow. Figure 2 shows the graphical 
relationship between w, and M. 

Remark 1. The representation of the mach number m as a function m(w, S) of 
the independent variables w and S suggests the possibility of taking m = m(w) as a 
first approximation of this quantity where m(w) is the function of w obtained by 
evaluating the mach number behind the shock. This corresponds to the usual 
neglect of the entropy in gas dynamical problems. From this standpoint we are 
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led to the value of we, which is strictly the inclination of the stream line at the point 
where m = 1 behind the shock, as an approximation to the inclination of the con- 
tour at the sonic pomt P. The determination of the end point P of the sonic line 
by the value of we instead of wp as given by (2) would appear to be acceptable as an 
approximation from the fact that dw is generally small in comparison with we. 


TABLE 1 
wQ bw wp 
0.514 147 .661 
1.403 439 .842 
3.691 918 .609 
3.300 302 .603 
9.000 .486 485 
.658 .550 3.208 
.198 . 298 5.496 
577 .072 649 
.80 775 .920 .695 
.790 .412 . 202 
00 22 .626 .059 .685 
29 . 554 —2.802 3.753 
.00 33.872 9.261 28.612 


OR Oe en ee ee ee ee 


Remark 2. The formula in §6 for the quantity A which is constant along stream 
lines can be written in the form 


‘oe [p Puen 
pipp 
Since u, vanishes at the point A in figure | it follows that the quantity A and hence 


D in equation (5) will vanish along the contour of the airfoil. The inclination > 
of the sonic line at P relative to the contour is therefore given by 
Op/Ow dp/deo 
tan y = es ae ? 
YP ypK 


where K is the curvature of the contour at P. 





FIGURE 3 
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Remark 3. In the case of the wedge the end point P of the sonic line will lie at 
the shoulder’ as shown in figure 3 on the basis of the discussion in this note provided 
that the angle wp as calculated from (2) is less than the wedge angle @. For angles 
wp > 4, which will occur for detached shocks when the mach number M is sufficiently 
large, the point P will not be determined on the wedge by the method of this note. 
This raises the question as to whether the shock will not be attached to the vertex B 
of the wedge when wp > 6 so that the case of the detached shock under considera- 
tion does not apply. This is an interesting question and one which might well be 
the subject of a separate investigation. In considering this problem, however, it 
must be borne in mind that the equation (2) for the quantity wp is approximate and 
that the inequality wp > @ must therefore be interpreted only as an approximate 
condition for the occurrence of the attached shock. 


1 Prepared for the Naval Research Laboratory, Washington, D. C. 

? These computations were made on SWAC at the Institute for Numerical Analysis, Los An- 
geles, California. 

3 See T. Y. Thomas, ‘‘The Distribution of Singular Shock Directions,” Jour. of Math. and Phys- 
ics, 28, p. 155 (1949). 

‘ We have omitted the factor Jc, from the second of these formulas since this quantity will can- 
cel from the final results. See T. Y. Thomas, ‘““The Determination of Pressure on Curved Bodies 
Behind Shocks,’’ Communications on Pure and Applied Mathematics, I1I, p. 109 and p. 111 (1950). 

5 See T. Y. Thomas, ‘Calculation of the Curvatures of Attached Shock Waves,” Jour. of Math. 
and Physics, 27, p. 280 (1949). 

6°“The Distribution of Singular Shock Directions,” loc. cit., p. 164. 
7“The Determination of Pressure on Curved Bodies Behind Shocks,” loc. cit., p. 112. 

’See T. Y. Thomas, ‘Distribution of Pressure on Curved Profiles in Supersonic Gas Flow with 
‘ariable Entropy,’’ these PROCEEDINGS, 36, p. 112 (1950). 
* The point P cannot lie at the vertex B of the wedge since the velocity and hence the mach num- 


_ 


ber m will be zero at B for detached shocks. It may also be mentioned that boundary layer ef- 
fects, which are neglected in this discussion, may actually play a significant role in the determina- 
tion of the point P when the value of the angle wp is near to the value of the wedge angle @. 


REAR SHOCK AND PRESSURE ON SUPERSONIC AIRFOILS 
By W. F. Brown* anp T. Y. THomast 
INSTITUTE FOR NUMERICAL ANALYSIS, LOS ANGELES, CALIFORNIA 
Communicated December 4, 1953 


1. Introduction.—Consider a curved airfoil along which the inclination w is a 
monotonically decreasing function of the are length with w = 0 at the point T 
(see figure 1). Let the curve w = const. join the point P, situated between the 
vertex V and the point 7’, with the point Q on the shock which is formed when the 
airfoil is placed in a uniform supersonic stream. A (1, 1) correspondence P<+Q 
between points on this shock line and the segment V7’ of the airfoil is thereby es- 
tablished such that V->V and Qo as P—+T. This correspondence lies at the 
basis of the derivation of a simple formula giving an approximate determination of 
the pressure on the segment V7’ of the airfoil.! It is the primary purpose of this 
note to show that this formula can be extended to give the determination of pressure 
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on the remainder of the airfoil by an analogous consideration involving the shock 
which is formed at the rear of the airfoil. In this connection no attempt will be 
made to treat the problem of the separation of flow. The rear shock therefore has 
been drawn in figure | from the rear vertex FR of the airfoil, as is frequently done, 
although this does not in general correspond to the physical fact. 

In § 3 we have made the assumption, which is at best an approximation, that the 
flow behind the rear shock is identical with the uniform stream in which the airfoil 
is submerged. This assumption leads directly, on the basis of the shock conditions, 
to the approximation formula (7) for the pressure on the segment 7'R. It is seen 
that (7) is identical with the previous formula for pressure derived by a considera- 
tion of the shock conditions relative to the front shock and which therefore strictly 
applies to the determination of pressure on the segment V7’. The discussion in §3 
confirms the extension of this previous pressure formula from the segment V7’ to the 








FIGURE 1 


segment 7'R, or indeed of the more general formulae for pressure determination 
which are derived from considerations based on the front shock and which depend 
on entropy variations in the flow, since it appears unlikely that the flow behind the 
rear shock will differ greatly from the free stream. 

It must be emphasized, however, that the use of these extended pressure formulae 
in no way implies that the flow quantities behind the rear shock are the same as those 
in the free stream in spite of the derivation in § 3. This is fortunate since it is 
shown in § 4 that the existence of the free stream flow behind the rear shock violates 
the entropy condition, i.e., the requirement that entropy cannot decrease across the 
shock front. Actually the entropy condition is also violated under the more general 
assumption of irrotational flow behind the rear shock as shown in § 5. Finally it is 
observed in § 6 that even the general rotational and non-isentropic flow behind the 
rear shock cannot be permitted in view of the natural boundary conditions, i.e., 
that the flow behind the rear shock approaches the free stream flow at infinity. 
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Influences such as viscosity and heat conduction, which are neglected in this dis- 
cussion, may possibly allow the realization of these natural boundary conditions. 
It seems reasonable to assume, however, that for finite regions where viscosity and 
thermal conductivity are negligible the general non-isentropic theory will give a 
satisfactory solution of the flow problem. 

2. The Shock Conditions.—Consider the shock conditions along the rear shock 
in the form 


2(pun? — yp) 


Ua) = (1) 
“es 
—2 af war ; ) 
[p] = os a. (2) 
aed) , 
ee 2p( pu, YP) (3) 


~ Qyp + (y — Ipu,?’ 


where the brackets [ ] have the usual significance and y is the usual gas constant. 
The unit normal vector v in (1) is directed as shown in figure |. Also p and p stand 
for pressure and density immediately behind the rear shock and the quantity u, 
denotes the normal velocity behind this shock. 

Dividing both members of (2) by p we have 


1 Pe Lis aa .. i) 
p v+1\yp 


where the subscript 2 is used to denote the value of a quantity immediately in front 
of the rear shock. Denoting by v the flow velocity immediately behind the shock 
let us put c? = yp/p and m? = v?/c?. Also u, = v sin (a — @), where a and @ are 
the angles shown in figure 1. Then the above equation can be written 

iD 2 


sti a [m? sin? (a — w) — 1] + 1. (4) 


Similarly we deduce from (3) that 
po 2y|m? sin? (a — w) — 1] 


= lj ‘ 
p (y — 1)m? sin? (a — a) + 2 a 


or 
wa 


and also by combining the two equations (1) we can obtain the following relation 


tan w = 


2[(m? — 1) tan? (a — @) — 1] 


F 
l(y — 1)m? + 2] tan*® (a — w) + [(y + 1)m? + 2] tan (a — @) (6) 


between the angles a, @ and the inclination w of the stream line immediately in front 
of the rear shock. 

3. Kzxtension of the Pressure Equation.—Let P’Q’ be a curve w = const. joining 
a point P’ on the segment 7'R of the airfoil to a point Q’ of the rear shock. We 
assume that the rear shock approaches the Mach direction of the free stream as the 
point Q’ approaches infinity. Then the curves w = const. establish a (1, 1) cor- 
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respondence P’ <> Q’ between points P’ on 7'R and points Q’ on the rear shock such 
that R «> Rat the tail of the airfoil. Neglecting entropy changes the pressure p, 
which in general is a function p(w, S) of the inclination w of the stream lines and the 
entropy S, becomes a function p(w) and hence the pressure as determined at Q’ 
by the rear shock conditions will be approximately equal to the pressure at the cor- 
responding point P’. Because of the above (1, 1) correspondence the pressure at 
all points of the segment 7'R will be obtained by this procedure. 

We now assume that the flow behind the rear shock is identical with the flow in 
the free stream. Then if p; and p; are the pressure and density in the free stream 
we have p = pi, p = pi, and m = M where M is the Mach Number of the free 
stream. Hence equation (4) becomes 

a ee er ee (7) 
— : ae a 

The angles a and are positive in equation (7) with a >a. Now under the above 
assumption the angle w is obviously negative since the flow could not otherwise, 
after deflection by the shock through the angle @, acquire the direction of the free 
stream for a shock line having the character of that shown in figure 1. _ It is there- 


fore clear that w = —@. Making this substitution in (6) we have 
tan (—q@) = 
Pe. heat TTT U a (8) 
[((y — 1)M? + 2] tan‘ ae — &) + [(y + IM? + 2] tan (@ — ) 


Now equations (7) and (8), in which the quantity a — @ can be thought of as a pa- 
rameter, express the pressure ratio p2/p; along the rear segment 7'R of the airfoil asa 
function of the inclination —o of this segment. That this formula for the pressure 
is an extension of the formula for the pressure over the segment V7’, as derived from 
a consideration of the front shock conditions, is seen from the fact that the latter 
equation is obtained from the above equations (7) and (8) by replacing the param- 
eter a — ® by a and —a in the right member of (8) by the inclination w of the 
segment VT’. 

4. Entropy Change Across Rear Shock.—Denoting by N the quantity in the rela- 
tion p = Npy the entropy S is defined by the equation S = log N/N, where the 
subscript | refers to the free stream. By the entropy condition S cannot decrease 
in passage across the shock. Actually for compression shocks, which we assume 
in this discussion, the entropy will be greater at points immediately behind the 
shock than at contiguous points immediately in front of the shock.? Hence we 
must have V/N, > | at the point A in figure 1. But N,/N > 1 at the point B on 
the same stream line as A (see figure 1) under the assumption that the flow behind 
the rear shock is the same as the flow in the free stream. Since the value of N is 
constant along stream lines and hence is the same in each of these inequalities we 
have thus arrived at a contradiction. In other words the flow behind the rear 
shock cannot strictly be identical with the free stream flow. 

5. Assumption of Irrotational flow.—Let us now generalize the conditions in § 4 
by making the assumption that the flow behind the rear shock is irrotational. 
Under this assumption it is well known that the quantity N will not merely be con- 
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stant along stream lines but constant throughout the entire flow. But as the point 
()' > ~ along the rear shock the value of N will approach the value N, of this quan- 
tity in the free stream. Hence N = JN, and we again have the contradiction ob- 
tained in § 4. 

6. Natural Boundary Conditions.—Removing the above restrictions let us sup- 
pose that the flow behind the rear shock is rotational and thus has the same general 
character as the flow between the two shock lines in figure 1. Now it is natural to 
assume that as we approach infinity behind the rear shock in the direction of the 
free stream the flow quantities, i.e., the pressure, density, and velocity, will approach 
the values of these quantities in the free stream. But this means that V — N, and 
since N is constant along stream lines it follows that N will have the constant value 
N, everywhere behind the rear shock. Hence the consideration in § 4 again applies 
and leads to a contradiction. We have commented briefly on this state of affairs in 
$i 1% 

* Research assistant under Navy Contract N6ONR-180, Task Order V, with Indiana Univer- 
sity. 

+ Consultant, Naval Research Laboratory, Washington, D. C. 

1 See Brown, W. F., and Thomas, T. Y., ‘‘Limiting Behavior of Pressure Derivatives Behind 
Shocks in Supersonic Gas Flow,” to appear in J. Rat. Mech. Analysis, 1954, §1. Also earlier 
papers by Thomas, T. Y., for example, ‘‘First Approximation of Pressure Distribution on Curved 
Profiles at Supersonic Speeds,’’ these PROCEEDINGS, 35, 617-627 (1949). 

2 See, for example, Thomas, T. Y., ““The Fundamental Hydrodynamical Equations and Shock 
Conditions for Gases,’’ Math. Mag., 22, 187 (1949). It can be inferred from the discussion here 
that V = N;, implies p2 = p; which contradicts the assumption that we have a compression shock. 
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In earlier reports! ? data were presented showing the frequency of genetically 
lethal second chromosomes in natural populations of Drosophila melanogaster in a 
few widely separated areas in the United States. The frequency of lethal chromo- 
somes was so high in these areas, and the amount of allelism among the lethals was 
so low in the three areas in which samples of flies were taken directly from nature, 
that it seemed most probable that breeding units within populations of this species 
average considerably larger than in those of other species of Drosophila which had 
been reported at that time. 

In the earlier studies with D. melanogaster, in all areas except Florida, samples 
were taken only in the early fall, when the natural population was close to peak size. 
Beginning in 1946 samples were taken from the local Massachusetts area at three 
times in the season: The first in June or July as soon as a large enough sample 
(about 100 flies minimum) could be collected within a week at one site to make the 
analysis worth while; the second about a month later; and the last collection as 





88 GENETICS: P. T. IVES Proc. N. A. 8. 


late in the season as possible before the first killing frost in September or October, 
Beginning in 1950 a number of other areas were sampled in the early fall, and the 
Florida site was sampled in the spring of 1951. In most of these samples allelism 
tests were also completed with the strictly lethal chromosomes. 

The only change in method from the earlier work consisted of the substitution of a 
new marker stock, known as Sifter to Drosophila geneticists, to facilitate making 
the test matings. Concurrent tests with Bl/Cy and Sifter in several different sea- 
sons showed no significant difference in lethal frequency (including semi-lethals) in 
the two methods of analysis. 

Table 1 shows the frequency of lethals in different geographic areas, all collections 
from each area being summed.* Each collection site was in a rural or suburban 
area. None was within a large city area. It can be seen that the lethal frequency 
was higher in the three southernmost areas, Virginia, Texas, and Florida, than in 
the other more northerly localities. Lethals include all cases with less than 17% 
of homozygotes, as in the 1945 report. A comparison with the similar data of table 
2 of that report shows that the lethal frequency is about 15% lower in these later 


TABLE 1 


FREQUENCY OF LETHALS IN DIFFERENT AREAS OF THE UNITED STATES 


DATE AREA TESTS LE Io 
1947-1952 Amherst, Mass. 3163 1105 34.9 
1950 New York State 527 170 32.3 
1952 Cannonsburg, Pa. 117 33 28.2 
1951 Wooster, Ohio 166 60 36.2 
1952 Lincoln, Neb. 133 34 25.6 
1951 Pullman, Wash. 138 54 39.1 
1950-1952 Blacksburg, Va. 805 346 43.0 
1952 Austin, Texas 98 41 41.8 
1951 Winter Park, Fla. 131 67 51.1 

Totals 5278 1910 36.2 


samples from each of the three areas represented in both tables, Massachusetts, 
Florida, and Ohio. (In Ohio the two collection sites were in neighboring small city 
areas.) The other areas in the present study are also lower than would have been 
expected from the earlier study. 

Tables 2 and 3 show in detail the change that has taken place in the two areas 
most critically sampled, Massachusetts and Florida. Items 1 and 2 of table 2 are 
from the 1945 report. Item 3 is from the 1947 report but has been corrected for 
the amount of allelism by removing those allelism tests involving lethal chromo- 
somes derived from collected females. All samples in table 2 were collected in 
September or October, and the allelism data are for lethal chromosomes derived 
from collected males. In table 3 the 1940, 1942 data are from the 1945 report. 
In the 1951 Florida sample, females and males were separated soon enough after 
collection so that no increase in frequency of alleles was observed among lethal 
chromosomes derived from the females of the sample. In this case, therefore, the 
allelism data include tests of lethal chromosomes derived from both males and 
females. 

Table 2 shows that apparently the major change in both lethal frequency and 
amount of allelism appeared first in 1947. The frequency dropped 10% from the 
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previous year and the amount of allelism approximately doubled compared to the 
average of the 1938-1946 period. The clearest statistical difference, however, is 
between the averages of the two groups of years, items 5 and 12. For the fre- 
quency of lethals, P is less than .0001, and for the amount of allelism it is approx- 
imately .0013, as determined by chi-square. The variation within each group of 
years is not significant, since the value of P is above .10 for each of the four com- 
parisons. 

Inspection of the data of table 2 shows, however, that there is actually consider- 
able variation in lethal frequency within each group of years, the statistical signifi- 
cance of which may be obscured by the necessarily limited amount of data which 
could be collected each year. Particularly is this true in the 1947-1952 period 
where it looks as though the lethal frequency dropped secondarily in 1949 and came 


TABLE 2 


FREQUENCY OF LETHALS AND ALLELES IN DIFFERENT SEASONS IN SouTH AMHERST, Mass, 

ITEM YEAR TESTS LETHALS o// CROSSTESTS ALLELES % 
l 1938 151 68 15.0 1176 5 0.43 
2 1941 108 64 59.3 None 
3 1945 190 87 15.8 666 3 0.45 
} 1946 79 39 19.3 165 I 0.22 
5 Total 1-4 528 258 48.8 2307 9 0.39 
6 1947 257 101 39.3 1039 9 0.87 
b 1948 162 60 37.0 1219 1] 0.90 
8 1949 198 60 30.3 i223 1] 0.90 
9 1950 163 52 31.9 528 4 0.76 
10 1951 182 68 37.4 1377 17 1.24 
1] 1952 190 ta 37.9 820 5 0.61 
12 Total 6-11 1152 113 35.9 6206 57 0.92 


TABLE 3 


THE FREQUENCY OF LETHALS AND ALLELES AT DIFFERENT TIMES IN WINTER PARK, FLORIDA 
YEARS TESTS LETHALS / CROSSTESTS ALLELES % 
1940, 1942 337 220 65.3 2281 10 0.44 
1951 131 67 51.1 1326 5 0.38 


back up to its 1947-1948 level in 1951-1952. The value of P for the six years as a 
group is .3; for the comparison of the three pairs of years (paired chronologically to 
give two degrees of freedom) it is .07; and for the comparison of 1949-1950, first 
with 1947-1948, and then with 1951-1952 it is .04 and .06, respectively. The P 
values are large enough so that the possibility of this variation being purely random 
cannot be considered unlikely. At the same time they are small enough to render 
more acceptable the alternative interpretation that there was a U-shaped decrease 
and increase in lethal frequency in the 1947-1952 period. There is no suggestion of 
any related inverse change in the amount of allelism observed in this period, perhaps 
because the data are not large enough to demonstrate comparatively small changes 
in this category. It is probably only coincidental that this apparent cycle of change 
occurred at exactly two-year intervals in this period of time. 

The data of table 3 show that the drop in lethal frequency (P is .0007) in the 
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Florida areas was not accompanied by a rise in the observed amount of allelism 
among lethals. In 1951 the lethal frequency and the amount of allelism in the 
Florida area each resembles closely the corresponding average in the Massachu- 
setts area in the 1938-1946 period. 

Some additional information on the Massachusetts population is given in table 4. 
These data show for the 1947-1952 period the average lethal frequency in the first, 
second, and last collections and the average amount of allelism in the first and last 
collections. The data are for five of the six seasons; in 1949 it was possible to 
make collections only in August and October. Inspection of the data suggests that 
in the time between the first and second collections, when the population was ex- 
panding rapidly, there was an increase in lethal frequency, but that in the rest of the 
season there was no change. It also suggests that there was the same amount. of 
allelism among lethals at both phases of population development, expansion and 
peak. 

In five of the six seasons in which the first collection came before August (includ- 
ing 1946) there was an increase in lethal frequency in the second collection; 1952 
was the exceptional year. The increase was not statistically significant in any one 
season. Considering only the chi-square P value of .08, the average increase in 
table 4 is on the borderline of significance statistically. If the increase is real (the 


TABLE 4 
THE 1947-1952 AVERAGE INTRA-SEASONAL FREQUENCIES OF LETHALS AND ALLELES IN SOUTH 
AMHERST, Mass. 


o7, o7 


SAMPLE TESTS LETHALS / CROSSTESTS ALLELES i 
First 726 233 32.1 2628 29 1.10 
Second 550 202 36.7 

Last 954 353 37.0 4983 46 0.92 


author feels that more data are needed to determine that fact) it can probably be 
satisfactorily accounted for by two or three generations of accumulation of new 
mutations.* The increase is not as large as the average increase in Russian popula- 
tions.° Goldschmidt has reported no increase during the expansion phase of 
Jerusalem populations.’ If new mutations are the cause in the present case, one 
must assume an earlier elimination of lethals in the comparatively dormant season, 
or in the earlier part of the breeding season than is represented in the first collec- 
tions, due either to inbreeding or selection. There are no data on the amount of 
allelism at actual breeding sites in the local area. 

The complete picture is not given with respect to allelism in the simple averages 
of table 4. In some seasons it was possible to test for allelism within and between 
three or more collections per season. The details of this study will appear else- 
where. They will show that each season probably differs from the others in the 
nature of the breeding units which send flies to the collection site. The picture is 
much more dynamic and complex than the averages given here suggest. The 
present data do show, nevertheless, that in this group of seasons the average amount 
of allelism among the lethal chromosomes was approximately the same in a potential 
gene pool at the time when the population was in its stage of rapid expansion as it 
was a little later on when the peak of population size had been reached. 

It is apparent from the data of tables 1, 2, and 3 that there was a significant 
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change in the genetic structure of natural populations of D. melanogaster in Americé 
during the past decade. One simple interpretation is that there has been a de- 
crease in population size, in the number of pairs of individuals which constitute the 
average breeding unit. This would cause an increase in the amount of inbreeding, 
which would in turn be reflected in an increase in allelism and a decrease in the fre- 
quency of lethal chromosomes. That description fits the data of the local area. 
Another possibility is an increased selection against lethal heterozygotes, which 
would lower the lethal frequency without necessarily changing the amount of allel- 
ism noticeably, such as appears to be the case in the Florida data of table 3. There 
is no conclusive evidence concerning these interpretations. 

It is possible that in the most southerly regions D. melanogaster is suffering from 
competition with its closely related species, D. simulans. With collection condi- 
tions as alike as it was possible to make them, the sample of 1942 from Florida con- 
tained an estimated 15% of simulans in about 300 males, while in the 1951 sample of 
990 flies there was 86% simulans, with no difference in percentage of simulans in 
males and females. The small sample from Austin, Texas, in 1952 had 40% simu- 
lans in the 116 males of the collection. D. simulans was very infrequent in the 
collections from other areas. It has appeared locally only in one or two males per 
season in late summer collections. 

The causative agent for the presumed decrease in size in the northerly populations 
of D. melanogaster during the 1940’s is not known. The change in climate to a 
warmer and drier summer, generally, has not been sufficient locally to interfere 
noticeably with the presumed food source of the species, decaying fruits, vegetables, 
and garbage, at least so far as can be seen by casual observation. The introduction 
of new fungicide and insecticide sprays in orchards and buildings may be a factor. 
So far as can be determined there was no change in spraying program in 1947 as 
compared to 1946 but several new sprays were introduced in subsequent seasons. 
These might have contributed to the apparent population drop in 1949 and 1950. 
Whatever the cause the species seems to have adapted itself to the situation and 
may have improved its position in 1951 and 1952. 

It seems obvious that experiments with large continuous cage populations may 
suggest the nature of the influence of certain environmental agents on the genetic 
factors of natural populations, beginning with some of the genetic factors already 
under investigation.7~"" At the same time it should perhaps be emphasized that 
in a natural population the combination of important variables and effective agents 
is probably so complex that only very general applications can be made of the find- 
ings from studies of artificial cage populations. It is not possible to “experiment” 
with such variables and agents and at the same time maintain truly natural popula- 
tions. Much remains to be done of value, however, at the descriptive genetic 
level in the study of natural populations of Drosophila melanogaster. 

‘Ives, P. T., Genetics, 30, 167-196 (1945). 

2 Ives, P. T., Evolution, 1, 42-47 (1947). 

3 The author is indebted to the following persons for collecting flies: Dr. Mary Alexander, 
Dr. R. P. Levine, Dr. M. Levitan, Dr. D. D. Miller, Dr. R. Moree, Mrs. H. H. Plough, Mr. M. 
Seiger, and Dr. W. P. Spencer. He is also grateful to Dr. B. Wallace for the use of his unpublished 
data from three samples of 449 chromosomes which are included in the New York State sample 
after the individual] tests had been reclassified by the standard for lethals used in the other samples 


of the present study. 
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4 Ives, P. T., Evolution, 4, 286-252 (1950). 

5 Dubinin, N. P., Genetics, 31, 21-38 (1946). 

6 Goldschmidt, Elizabeth, Drosophila Information Service, 26, 102-103 (1952). 
7 Wallace, B., Evolution, 4, 172-174 (1950). 

8 Wallace, B., and King, J. C., these PRocEEDINGs, 38, 706-715 (1952). 

® Wallace, B., and Madden, Carol, Genetics, 38, 456-470 (1953). 

1 Levine, R. P., and Ives, P. T., these PRocEEDINGS, 39, 817-823 (1953). 
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A number of cases have been reported in Drosophila'* and Aspergillus,” 
apparent cases in Neurospora* ' and cotton,!! in which recombination occurs be- 
tween mutants which otherwise classify as allelic by the usual phenotypic and 
genetic criteria. Such mutants have been designated as pseudo-alleles. In addition 
to recombination, pseudo-alleles manifest a differential phenotype depending upon 
whether the mutants are compounded in coupling or in repulsion. Thus in a case 
of pseudo-alleles which can be designated as m, and me, both of which are recessive 
to wild-type, the phenotype of individuals of the genotype m,; +/+ mz is mutant, 
while that of individuals m, mz/+ + is wild-type. Alternative explanations have 
been proposed to account for this situation. On the one hand, it has been reasoned 
that pseudo-alleles represent separate genes whose loci and functions are distinct. 
The phenotypic differences alluded to are then interpreted as being the consequence 
of position effect." * 4 Alternatively, it has been argued that pseudo-alleles are 
components of a larger “‘physiological’’ gene, whose functions are indistinguishable 
but whose component parts are separable by crossing-over.® '? The phenotypic 
differences associated with pseudo-alleles are therefore those expected according to 
the usual rules of dominance. 

These alternative interpretations imply conflicting definitions of the gene. On 
the one hand the gene as defined in terms of linkage relations, mechanism of action 
and mutability is one and the same. On the other the limits determined for the 
gene in terms of linkage do not necessarily coincide with the limits of the gene in 
terms of function. 

It is the purpose of this report to discuss a case of pseudo-allelism at the vermilion 
eye color (v) locus in D. melanogaster and to determine what bearing it has on the 
interpretation of pseudo-allelism and the gene. 

Phenotypic Differentiation of v Mutants.—In an earlier discussion of several sex- 
linked, recessive v mutants of D. melanogaster, it was noted that while they are 
essentially phenotypically indistinguishable, it is possible to separate them into 
two classes on the basis of their phenotypes in the presence of a non-allelic sup- 
pressor mutant.'* Certain of the » mutants (collectively designated as v") are 
suppressed, while others (collectively designated as v") are unsuppressed. It was 
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also noted that v* and v“ mutants react in identical fashion in gynandromorphs and 
when developing larvae are supplied either formyl-kynurenine or kynurenine.'* 
Under the latter conditions both types of v mutants synthesize brown eye pigment. 

Additional data are now available which serve for further comparison of the v 
mutants. It has been known that v° mutants when subjected to partial larval 
“starvation” synthesize brown eye pigment.'' Tests of the effects of larval 
“starvation’’ on the eye color of v“ mutants have been performed and demonstrate 
that they fail to synthesize brown eye pigment under these conditions," thereby 
presenting a second means of distinguishing between v mutants. It has also been 
reported that v° mutants raised in an optimal nutritional regime accumulate non- 
protein tryptophane.'® A comparison of the non-protein tryptophane contents of 


TABLE 1 TABLE 2 
COMPARATIVE PHENOTYPIC RESPONSES OF THE TEstTs OF v MUTANTS TO SUPPRESSOR, su2-s 
Two CLASSES OF v MUTANTS ¥m, : 1s 
ORIGIN OF SUP- UNSUP- 
sie MUTATION PRESSED PRESSED N 
TRY PTO- 

t LARVAL FORMYL- PHANE . ? P : 
MU- SUP- “STAR- KYNUR- KYNUR- ACCUMU- Spontaneous 3 3 6 
TANT PRESSOR VATION’ ENINE ENINE LATION X-rav and gamma 0 15 15 
y as ste nbs a yes Ultra-violet 0 I l 
y" = “= a + yes 
+ = brown pigment formed; — = no 

brown pigment formed. 
TABLE 3 


CROSSING-OVER BETWEEN v! aNnp v°/ (EXPERIMENTS 1-3, 2 2 witH FREE X CHROMOSOMES: 
IXPERIMENTsS 4 AND 5 9 9 wirH ATTACHED-X CHROMOSOMES) 


- ———-GENOTYPE OF 9 9—— — 
X chromosomes; - ——Autosomes*——————_ vt g' Q N 
1. v//ras? vi mf; Cy/+; In(3)CXF, D/+ Lie =e ey 34,556 
2. v/ras? vt mf; Cy/+; Sb, In(SR)C/+ ld =vtmf 33,600 
3. yp! /ras? vl m f; Cy/+; In(3)CXF, D/Sb, In(8R)C 0 37,448 
4. ras? v} f /y36/ mf; Cy/+; Uba/+ 19 =otf 45,651 
5. ras? pl f /ya6! mf; Cy/+; In(3)T, Me/+ 399 =vtf 23,951 


* See Bridges and Brehme?? and Lewis® for description of autosomal inversions 
4 I . 


v and v“ mutants cultured under such conditions shows that they accumulate like 
amounts of non-protein tryptophane, and it is not possible to distinguish between 
them on this basis.'* ‘Table | comprises a summary of the several phenotypic and 
biochemical comparisons which have been made with the v mutants. It may be 
noted that only the suppressor and larval “‘starvation”’ serve to distinguish among 
the » mutants. 

Crossing-over Between v Mutants.—It was suggested earlier in view of the separa- 
bility of » mutants on the basis of the suppressor that the v locus is pseudo-allelic.' 
This possibility has been tested, using the mutants v' (suppressed) and v*” (un- 
suppressed). In these experiments the mutants raspberry-2 eye color (ras?), 0.2 
unit to the left, and miniature wing (m) 3.1 units to the right of the v locus, were 
used as crossing-over markers. Females of the genotype ras? v! m/v*” and hetero- 
zygous for several autosomal inversions were constituted and crossed to ras? v! m 
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oo. The results of these crosses are listed in table 3. It may be noted that two 
vt m oo were recovered. The occurrence of v* individuals associated with 
crossing-over is indicative of pseudo-allelism. This may be best accounted for here 
by assuming that v! lies just to the left of v*” such that the genotype of the parental 
© 2 was ras? vy! +m/+ + vo +. A single crossover occurring between v! and v* 
would yield + + + mor ras? v! v” individuals. 

It may be noted that on the basis of pseudo-allelism, in addition to the v*+ m 
individuals recovered from Q 2 ras? v'! m/v*’ the complementary crossover type, 
viz., ras® v' v* should oecur. There-was no evidence for the occurrence of this type 
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FIGURE 1 
Scheme for detecting crossing-over between pseudo-alleles v! and vo in attached-X @ 9. 
The upper diagram represents the genotype of the attached-X 9 9. The reciprocal crossover is 
designated (a), the non-reciprocal (b), and the corresponding crossover products shown below 


are designated (a) and (b). 


in the initial crosses. In fact, it appears highly unlikely that individuals of the 
genotype ras? v' v* could be differentiated phenotypically from @ Q ras? v! or 
ras? v, Since both the v! and v*’ mutants are incapable of utilizing tryptophane, 
it appears reasonable to conclude that the mutants coupled together on the same 
X chromosome would also fail to utilize tryptophane. Accordingly, it would be 
expected that the phenotype of individuals homozygous for v! v*/ would be in- 
separable from that of individuals homozygous v! or yr 

There does exist at least one method which permits the recovery from 9 9 
v'/y of both products of crossing-over between the pseudo-alleles. This involves 
crossing-over in attached-X 9° 9. This technique was first utilized for recovering 
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simultaneously both products of unequal crossing-over in Bar eye in D. melano- 
gaster,"’ and has more recently been used to establish pseudo-allelism at the white 
locus in D. melanogaster.» Accordingly attached-X 92 @ of the genotype ras? v'! + 
f/+ + vo m f were constituted, autosomal inversions introduced, and these Q @ 
were then crossed tgu f cc’. The 2 progeny of these crosses were examined and 
vt f 9 2 were sought. In figure 1, the genotype of the attached-X @ @Q is illus- 
trated; in addition, the products of crossing-over between the v pseudo-alleles are 
listed. As indicated in figure 1, three products of crossing-over between the 
pseudo-alleles can occur, but only the two producing v*+ phenotypes are detectable. 
The v+ phenotypes will occur as a result of either a reciprocal or a non-reciprocal 
crossover. It may be noted that v+ 2 2 occurring as a result of a reciprocal cross- 
over between the pseudo-alleles will possess both chromatids involved in the cross- 
over, i.e., Will have one X chromosome carrying the wild-type alleles of the v pseudo- 
alleles while the homologous X will carry the two mutants pseudo-alleles. These 
2 2 will be of the genotype indicated for type (a) in figure 1 and will be pheno- 
typically v+ f. That such is the case can be determined by a progeny test. If the 
vt+f Q is the result of a reciprocal crossover, there should occur among her attached- 
X @ progeny individuals of the phenotype ras? v m f. These, the result of a sub- 
sequent non-reciprocal crossover occurring to the right of m in the v* f 2, represent 
the individuals homozygous for both pseudo-alleles and each X chromosome is 
genotypically ras? v! v*! m f. 

The vt f 2 @ occurring as a result of non-reciprocal crossover between the 
pseudo-allelic loci will possess one crossover chromatid and one non-crossover 
chromatid. Their genotype corresponds to (b) of figure 1 and also will be pheno- 
typically v+ f. On progeny testing, these. 2 2 should produce among the attached- 
X @ progeny the phenotype v*’ 
crossover occurring to the right of m in the vt f @. 


mf. This occurs as a result of a non-reciprocal 

In the initial tests made as listed in 4 of table 3, one v+ f attached-X 2 was 
recovered. On progeny testing, this 2 produced a total of 113 attached-X @Q 
progeny, of which 105 were phenotypically v+ f and 8 ras? v m f. Since the eye 
color of ras? vy! v*/ mf Q 9 is inseparable from that of 2 9 ras?v! and since the 2 Q 
carry the markers expected for a reciprocal crossover, it may be concluded that these 
attached-X @ Q are genotypically homozygous ras? v! v*’ m f. The attached-X’s 
of 2 2 ras? v' v! m f were detached and tested to the suppressor. As expected, 
the suppressor failed to alter the eye color. Thus it can be concluded that the 
crossover carrying the two pseudo-alleles has been recovered. 

Three additional cases of crossing-over between the v pseudo-alleles in attached- 
X 2 2 have been recovered in an experiment where a different combination of autos- 
omal inversions was used. These are listed in 5 of table 3. 

Cytology of the v Locus.—In a number of cases of pseudo-allelism in D. melano- 
gaster, evidence has been submitted which points strongly to the association of the 
pseudo-alleles with a doublet as observed in the salivary gland chromosomes.‘ To 
determine whether such an association exists in the case of the v pseudo-alleles four 
independent v deficiencies were examined cytologically. While the limits of the 
deficiencies varied, all had one feature in common, viz., the absence of the doublet 
10A 1-2 as listed in Bridges’ revised salivary gland X chromosome map." The 
association of the v locus with a cytologically visible doublet supports the interpreta- 
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tion of pseudo-allelism, but it cannot be concluded with absolute certainty that v! 
is localized in the left half (10 Al) and v*” in the right half (10 A2) of the doublet. 

Mutation to v* and v“.—In the course of this study a total of 22 independently 
occurring v mutations have been assembled and tested to the suppressor. These 
mutants, a number of which were contributed by other investigators, all arose from 
v* and include both spontaneous and induced mutants. In table 2 the results of 
the tests of the mutants with the suppressor together with their mode of origin are 
summarized. It is significant to note that whereas among the spontaneous mutants 
both v* and v“ types occur, only v“ mutants have been found in the induced group. 
Or phrased slightly differently, while v* mutants occur spontaneously, they have 
thus far failed to occur among the induced v mutants. Does this observation mean 
that there is a difference in induced mutability between the pseudo-allelic v loci? 
This question cannot be conclusively answered at this time for a number of reasons. 
The data on crossing-over between the pseudo-alleles have been derived from 
spontaneously occurring mutants. Thus far crossing-over between a v* mutant 
and an x-ray induced v“ mutant has not been observed although admittedly ex- 
tensive trials have not been made. Two x-ray v“ mutants were tested with v! in 
attached-X @ 9, paralleling experiment 4 of table 3. In each case no vt+ Q Q 
were found among 40,000 progeny. These negative results are not conclusive. 
If the observed crossing-over between v! and v*” is a reasonable approximation of 
the frequency of crossing-over between the pseudo-alleles (see table 3), the negative 
results noted can readily be accounted for on the basis of chance. The relatively 
low frequency of crossing-over between v! and vo’ has for the time being discouraged 
large scale testing for crossing-over between other v mutants. 

If it is assumed that the suppressor does distinguish between the pseudo-allelic 
loci, it is possible to interpret the origin of induced v“ mutants in one of three ways: 
(1) All induced mutants are “point”? mutations allelic to v*’, (2) All induced 
mutants involve botk. loci either as small rearrangements or ‘‘point’’ mutations at 
both loci. (3) Induced mutants include both types (1) and (2). 

The available genetic and cytological information indicates that none of the 
induced v“ mutants is associated with gross chromosomal rearrangement. While it 
cannot be stated conclusively at this time whether induced v“ mutants involve 
both pseudo-allelic loci, there is evidence available from studies of lozenge (lz) 
pseudo-alleles® '* that /z mutants rarely if ever involve two loci simultaneously. 
Analysis of eight independent x-ray induced lz mutants demonstrates that each can 
be assigned, on the basis of crossing-over, to one of the three /z loci. If the induced v 
mutants are homologous to the induced lz mutants in that only one locus mutates 
at a time, and if the suppressor does distinguish between the pseudo-allelic loci, 
then the results noted demonstrate differential mutability of the v pseudo-allales. 
It is pertinent to point out here that the observations on induced mutation are not 
peculiar to the v pseudo-alleles. In a study now in progress!® on the forked (f) 
locus in D. melanogaster, it has been shown that it is possible to distinguish be- 
tween f mutants by using a non-allelic suppressor mutant; pseudo-allelism is indi- 
cated for the f locus since crossing-over occurs between suppressed and unsup- 
pressed f mutants; and all induced f mutants studied are unsuppressed. The 
analysis of the f locus will be discussed shortly. 

Discussion.—The foregoing account of the genetics and cytology of mutants at 
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the v locus in D. melanogaster has established the existence of two pseudo-allelic v 
loci. It is of importance to reconsider here the bearing this case of pseudo-allelism 
has upon the general interpretation of pseudo-allelism. As emphasized at the out- 
set, the specific question which the phenomenon of pseudo-allelism poses may be 
stated as follows: do pseudo-alleles represent closely linked genes with distinctive 
functions or do they represent an integrated functional unit of the chromosome 
whose component parts can be separated by means of crossing-over? At first 
glance, it would appear that the latter interpretation more closely fits the v pseudo- 
alleles. The phenotypic effects of the several v mutants alluded to previously 
indicate that all v mutants are incapable of utilizing tryptophane, the initial step in 
the biosynthesis of brown eye pigment. This is borne out by the observations that 
all v mutants accumulate non-protein tryptophane and respond similarly by synthe- 
sizing brown eye pigment when supplied either formyl-kynurenine or kynurenine. 
Thus, it would appear that v mutants are functionally inseparable. There are, 
however, two observations which militate against an interpretation of functional 
identity among the several » mutants studied. These are the formation of brown 
eye pigment by some but not all mutants, either in the presence of a non-allelic 
suppressor or when subjected to partial larval “‘starvation.’’ It should be added 
here that there is a slight but consistent phenotypic difference which the experienced 
observer can use to distinguish between v mutants. The phenotypes of v* and v“ 
individuals can be distinguished when in addition they are homozygous for the 
autosomal eye color mutant brown (bw). Thus while the eye color of v“; bw flies 
is essentially white that of v*; bw is not and a discernible quantity of brown eye pig- 
ment is formed. 

The role of the suppressor mutant has significance for the interpretation and 
meaning of pseudo-allelism. As noted previously, the suppressor mutant su?-s not 
only suppresses some v mutants but also the sex-linked recessive body color mutant 
sable (s). A number of alleles of su?-s have been reported ;”° one has been described 
as suppressing not only v but the recessive, second chromosome eye color mutant 
purple (pr), while another has been described as suppressing the recessive, second 
chromosome mutant speck (sp). Accordingly, appropriate tests were made to 
determine whether su?-s suppresses pr and sp. The results obtained demonstrated 
that sw?-s suppresses pr and sp in addition tov and s._ An allele of su*-s, su’-v, also 


’ 


was found to suppress pr and sp. 

These observations on the action of the suppressor support the contention that 
the v* and v“ mutants have separate, distinctive actions. It is pertinent to point out 
here that pr eye color mutant is autonomous,”! in contrast to both the v* and v“ mu- 
tants which are non-autonomous in development.'* *!_ In addition pr flies do not 
accumulate non-protein tryptophane.'® Thus the suppressor acts to suppress other 
mutants, but fails to suppress v“ mutants which superficially appear to be closely 
related in their mode of action to the v’ mutants. It therefore appears highly un- 
likely that the v° and v“ mutants can be identical in their modes of action and yet be 
separable in their reactions in the presence of the suppressor. Since the mutants v! 
and »* have been separated by crossing-over, by the suppressor, and by larval 
“starvation,’”’ the conclusion can be drawn that these mutants constitute different 
genes with distinctive loci and functions. These findings for the v pseudo-alleles are 
in accord with those noted for the bithorax-bithoraxoid pseudo-alleles* which can be 
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separated not only by crossing-over, but in addition on, the basis of their distinctive 
phenotypic effects. In addition, the case of the inositolless mutants in Neurospora, 
which can be distinguished by means of a suppressor and which apparently cross 
over, parallels the results reported here for the v pseudo-alleles.* ?? 

The question of the basic difference in the mode of action of the mutants v' and 
v* cannot be satisfactorily answered at this time. It has been noted that both 
mutants are blocked biochemically at the initial step in the biosynthesis of brown 
eye pigment and thus both accumulate non-protein tryptophane. The investiga- 
tion of the genetic control of tryptophane desmolase in Neurospora is a case in 
point.2*> Here, it has been ably demonstrated that two allelic mutants, separable 
by an independent suppressor, both influence the same enzyme activity. By 
analogy, the case of the v pseudo-alleles may also represent a case where the same 
enzyme system is altered by different mutant genes in different ways. The fact 
that v* flies can synthesize brown eye pigment in the presence of the suppressor or 
when subjected to larval “starvation” suggests that they may, in fact, possess the 
necessary enzyme system which catalyzes the oxidation of tryptophane but that the 
particular enzyme(s) is either inactive or is inactivated by the v* gene or its products. 
The suppressor or larval “starvation’’ thus serve to activate the enzyme(s) or to 
remove its inhibitors. In the case of the v“ mutants it would appear that a thus far 
irreversible alteration in the enzyme system has occurred such that it is incapable of 
catalyzing the oxidation of tryptophane. It must be emphasized that the foregoing 
interpretations of the mechanism of action of the v* and v“ mutants are tentative. 
While there is no a priori reason to believe that the same enzyme system cannot be 
altered by different genes and therefore in different ways, there is as yet no informa- 
tion available which permits a critical discrimination among the possible mecha- 
nisms which might be operative here. 

What interpretation can be submitted to explain the differential phenotype 
observed when the pseudo-alleles are in coupling versus repulsion? Since all v 
mutants are recessive to wild-type it readily follows that individuals of the coupling 
genotype v! v*//+ + should be wild-type in phenotype. However, the fact that 
individuals of the repulsion genotype v! +/+ v*® are vermilion in phenotype clearly 
fails to follow the expectation based on dominance of the wild-type alleles. Thus 
any explanation of phenotype based on the assumption of simple dominance and 
recessivity fails. Rather it appears possible to interpret the resultant phenotypes 
in terms of the primary action of the mutant genes themselves along the lines 
previously discussed by Lewis.‘ It is assumed here that the primary action of both 
v' and v*” mutants involves a block at tryptophane, each accomplishing the block in 
a different way. The interpretation of the phenotypes of the mutants in coupling 
and repulsion must include the assumption that the primary gene-controlled reac- 
tions are here confined to the immediate vicinity of the gene. Moreover, it must be 
assumed that the immediate reactions governed by the genes on one chromosome 
are independent of those on the homologous chromosome. ‘Thus in the case of 
individuals of the genotype v! v°%/+ +, the utilization of tryptophane is blocked by 
the v mutants on the one chromosome, but this block does not impede the action of 
the + alleles on the homologous chromosomes in utilizing tryptophane and thus 
brown eye pigment is synthesized. However in the case of the individuals of the 
genotype v! +/+ v®, each mutant pseudo-allele acting independently blocks the 
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utilization of tryptophane and thus prevents the biosynthesis of brown eye pigment. 
Thus while the wild-type alleles of each pseudo-allele are present, their effects are 
rendered ineffective by the action of the adjacent v mutants and the resultant 
phenotype is vermilion rather than wild-type. 

Summary.—A phenotypic comparison between two types of v mutants in D. 
melanogaster is presented. 

Pseudo-allelism at the v locus is indicated since crossing-over between the mutants 
v' and v*” was obtained. 

Cytologically the v loci appear to be associated with the doublet 10 A1-2 in the 
salivary gland chromosome. 

The bearing of these results on the interpretation of pseudo-allelism and the 
nature of the genes involved is discussed. 
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NUMERICAL PREDICTION OF CYCLOGENESIS* 
By J. G. CHARNEY 
THE INSTITUTE FOR ADVANCED STUDY 
Communicated by J. von Neumann, November 16, 1953 
1. Introduction.—In a recent article! the writer and N. A. Phillips devised a 
sequence EZ, of mathematical models of the atmosphere capable of predicting the 
geopotentials of n suitably selected isobaric surfaces, and then presented the 
results of a series of consecutive 24-hour numerical integrations that had been 
carried out with the models #; and #» for an intensely cyclogenetic situation, 
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The predictions were moderately accurate during the period immediately preceding 
the storm but deteriorated markedly in accuracy after its onset. Although 
a rudimentary type of development was observed to take place in the second 
model, in neither case could it be said that the cyclogenesis had been predicted. 

These results were not unanticipated. The model #; was a single layer auto- 
barotropic atmosphere in quasi-hydrostatic and quasi-geostrophie equilibrium. 
Such an atmosphere contains no effective source of kinetic energy and permits 
no individual change in vorticity. The model F», corresponded closely to an 
atmosphere consisting of two superimposed autobarotropic layers. An atmosphere 
of this type does permit potential-kinetic energy conversions as well as individual 
vorticity changes but does not permit horizontal variations of static stability and 
consequently imposes a severe artificial constraint upon the motion. For example, 
it cannot simulate the undoubtedly important effects of the low-level thermal 
asymmetries associated with a front. 

At the same time it was not obvious that the above-mentioned shortcomings 
had been solely responsible for the failure to predict the cyclogenesis. Defects of 
a more fundamental nature might have been inherent in the postulates underlying 
the entire sequence of models. The fault might have resided in the quasi-geo- 
strophic approximation, or possibly in the failure to take into account non-isen- 
tropic changes. It was even conceivable, had the view been held that the cyclo- 
genesis was an instability phenomenon of the conventional type, that the error might 
have been due to the practical indeterminacy of the initial perturbation. 

Since the three-level model EF; is relatively free of the first class of defects but 
not of the second, numerical integrations with this model for the same periods 
could be expected at least to localize the sources of error. Such integrations were 
accordingly programmed and carried out on the Institute for Advanced Study 
(LAS) computer. The integrations showed conclusively that the errors had been 
of the former variety, for in fact the initiation and subsequent development of the 
storm were predicted. As this was perhaps the first instance of a purely mathe- 
matical prediction of cyclogenesis, it is believed that the presentation of the results 
and a discussion of their implications will be of general interest. 

2. Description of Model.—The basic premise used in the construction of the 
models £,, is that the motion of the atmosphere departs only slightly from solid 
rotation with the earth. If the horizontal scale of the motion is sufficiently large, 
it then follows that the flow will be in quasi-hydrostatic and quasi-geostrophic 


equilibrium, 1.e., 


Od 
ap T= (1) 
and 
fuxXk+vVo20. (2) 


Here the pressure p is the vertical coordinate, ¢ is the geopotential, a the specific 


—> . . . ——. . . 
volume, v the horizontal relative velocity component, k the upwardly pointing 
vertical unit vector, V the horizontal gradient operator, f = 2Q sin ¢g, where Q is 
the earth’s angular velocity and ¢ is the latitude. If it is also assumed that the 
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flow is isentropic and non-viscous, it may be shown that the motion is governed 
entirely by the scalar conservation equations for entropy and potential vorticity 
q.2 From these the equations for the sequence of models F,, are derived by a 
series of transformations and finite-difference approximations. If the earth is 
mapped conformally onto a plane with the cartesian coordinates x and y, and m 
is the seale factor, the equations take the form 


O = O”d: O(m?f~ Ok ob f; dx) 
»Viq = a —— == (3) 
(2 T )n ot O(x, y) 
dk = V "dx + pe A br (k a I, 2, ae nN), (4) 
r = 1 


in which ¢; is the sole dependent variable and the matrix (A,,) is constant and 
negative definite. 

The boundary conditions for a region R in the plane bounded by the simple 
closed curve C are that the ¢’s must be specified as functions of time on C and the 
q's at those and only those points of C where fluid is entering R. Initially ¢ must 
he specified everywhere within and on the boundary of R. 

The indices k refer to the isobaric surfaces 


~~ f 
n 


where po is the mean surface pressure. Assuming po = 1000 mb we get for n = 3: 
pi = 167 mb, po = 500 mb, and ps; = 833 mb. In practice it is possible to calculate 
the ¢-values at these levels from those at the standard levels 200, 500, and 850 mb, 
by adding standard ¢ differences. Thus ¢(833) & $(850) + standard (¢(833) — 
¢(850)). If a different choice of levels is wanted, one has merely to introduce a 
monotonic transformation ¢ = o(p) in the derivation of the equations (3). The 
new standard levels are then found by solving 
oe P 
o(Pr) = - [o(po) — o(0)] 

for p,. The transformation ¢ = p? is particularly convenient. Taking po = 1000 
mb again, one finds p, = 408 mb, po = 707 mb, and ps; = 913 mb, and one may 
use data at the standard levels 400, 700, and 900 mb. These give better definition 
to the low-level motions. 

3. Method of Solving the Prediction Equation.—To solve the equations (3) and 
(4) we take as given the fields of ¢, and q, at times t — At and ¢ at the points of 
the rectangular net x = 7As, y = jAs (¢ = 0, ..., 1; 7 = 0, ..., m). The local 
time derivative 0q,/Ot at time ¢ is evaluated from a finite-difference analog of (3) 
and gq, at time ¢ + At is obtained from 


O"0x 
Q(t + At) = gy(t — At) + alk (t). 


The values of ¢,(¢ + At) are then found by solving the finite-difference analog of 
the system (4), 
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(As) *(pit Ljk sis Pi Ljk “ Pijt+ik = dij | 1 jx) + p> A dijr = Jill + At) 


st 
in which the subscripts 7 and j denote quantities at « = iAs, y = jAs. The solu- 
tion of this equation may conveniently be obtained by the iterative scheme 
(v + 1) (y) 9 ; j (v) (» + I) (v) 
Pik = oi: + A[4 — (As)*Bix] "1 OF 4 1328 +O 1gje + O53 412 4+ 


( 
f 


n 


O) Le — 4OuR + (as)? | D Anos — qurlt + 4 | 


p= j 


Geel... f—-h 7 =1,...,6—- bb = 1,254); 

(y»=0,1,2,...; € = vforr > k,e = v+1forr < k) 
applied to the points 7jk in lexicographic order with &k first, 7 second, and 27 last.* 
It may be shown that the sequence { oi} will converge whatever the initial guess 
,;. providing only that the matrix (A,,) is negative definite and ) is appropriately 
chosen. 

A 24-hour prediction carried out on the LAS electronic computer in 48 half-hour 
time steps At for a 16 X 16 grid required 48 minutes of computing time. During 
this time the machine performed approximately 750,000 multiplications and divi- 
sions, 10,000,000 additions and subtractions, and executed 30,000,000 distinct 
orders. The space interval As used for all predictions was 300 km. 

4. Presentation of Results —At 12:30 Z (Greenwich Mean Time), November 24, 
1950, a small depression was observed to form on a cold front extending along the 
Atlantic seaboard of the United States and to develop within 24 hours into one 
of the worst storms ever recorded in the eastern United States. The surface 
weather sequence is shown in figures | (a, b, c,d). Because of the spontaneity and 
intensity of its development this storm was selected as an ideal test case for the 
prediction of cyclogenesis. The previous integrations with £, and E, had been 
made for the six overlapping 24-hour periods 11/23/03Z-11/24/03Z, 11/23/15Z- 
11/24/15Z, 11/24/03Z-11/25/03Z, 11/24/15Z-11/25/15Z, 11/25/03Z-11/26/03Z, 
and 11/25/15Z-11/26/15Z. The model FE, gave predictions at 500 mb, and the 
model #2 predictions at 300 and 700 mb. The predictions with £; were made for 
the same periods and at first for the levels 200, 500, and 850 mb. 

It is not possible to say with precision whether or not cyclogenesis was predicted 
with the autobarotropic model; this model yields only a 500 mb forecast, whereas 
most of the development, in the sense of increase in cyclonic vorticity and kinetic 
energy, took place at lower levels. It may be seen from figure 5, however, that the 
changes that did oecur at 500 mb during the 24 hours following 15Z, November 
24, were poorly predicted. 

The charts in figure 2 show that a deepening did actually occur at 700 mb but 
was not predicted by the two-level model, which instead predicted a filling. How- 
ever, it should be remarked that a rudimentary deepening was predicted in another 
place, just off the coast of the United States between 35° and 40° latitude, and that 
this may perhaps indicate that the mechanism of cyclogenesis is inherent, although 
in a crude form, even in the two-level model. 

The actual motion at 850 mb resembled the surface motion; a new low center 
formed to the southwest of the original center after 12 hours and in the succeeding 
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FIGURE 1 


(a) Surface weather chart for 12:30 Z November 24, 1950. Shading indicates areas of active 
precipitation. (b) Surface weather chart for 00:30 Z November 25, 1950. Small squares con- 
nected by arrows indicate past positions of main low center at 12-hour intervals. (¢) Surface 
weather chart for 12:30 Z November 25, 1950. (d) Surface weather chart for 00:30 Z November 
26, 1950. 
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FIGURE 2 


(a) Observed 700 mb height contours at 15 Z November 24, 1950. The contours are labeled 
as deviations from the standard height of 9879 ft. in units of 10 feet. (b) Two-level 700 
mb prediction for 03 Z November 25. (c) Two-level 700 mb prediction for 15 Z November 25. 
(d) Observed 700 mb chart for 15 Z November 25. The small circles connected by solid lines 
indicate the successive positions of the observed low center, and those connected by dashed lines 
the predicted positions. The height difference at the center is printed above, and the time below, 
each circle. 
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FIGURE 3 
(a) Observed 850 mb height contours at 15 Z November 24, 1950. The contours are labeled 
as deviations from the standard height of 4779 ft. in units of 10 feet. (6) Two-level 850 mb 
prediction for 03 Z November 25. (¢) Three-level 850 mb prediction for 15 Z November 25. (d) 
Observed 850 mb chart for 15 Z November 25. See legend under figure 2 for explanation of circles 
and connecting lines. 
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12 hours grew to such an extent that it dominated the entire flow over the eastern 
United States. It will be seen from figure 3 that a good part of this development 
was predicted by the three-level model, although as far as could be ascertained no 
separate low center was predicted to form. 
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FIGURE 5 
(a) Observed 500 mb height contours at 15 Z November 24, 1950. The contours are labeled 
as deviations from the standard height of 18281 ft. in units of 10 feet. (b) One-level (auto- 
barotropic) 500 mb prediction for 15 Z November 25. (¢) Three-level 500 mb prediction for 15 Z 
November 25. (d) Observed 500 mb chart for 15 Z November 25. See legend under figure 2 
for explanation of circles and connecting lines. The upper dashed line is the path of the low center 
in the one-level model, the lower dashed line is the path in the three-level model. 


The failure to predict a new center can be attributed in part to the location of the 
forecast levels in the three-level model. The 850 and 500 mb geopotentials do not 
adequately describe the motion between the ground and say 700 mb, yet it was 
here that the most intense part of the development took place. An additional 
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computation was therefore made with 
700, and 400 mb levels. 
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The results 
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a three-level model defined for the 900, 
are shown in figure 4. 


It will be seen 


that a new center was predicted to form and to deepen by very nearly the 
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FIGURE 6 


Correlation coefficient r between predicted and observed 
24-hour height changes plotted as a function of the initial 


time, for the period November 23-25, 1950. 


The letters 


A and P denote the times 03 Z and 15 Z, respectively. 
The dashed line gives r for the one-level prediction at 500 
mb, the dotted line gives r for the two-level prediction at 
700 mb, and the continuous line gives r for the three-level 
prediction at 500 mb. 


correct amount. The path of 
the predicted center (dashed 
trajectory in figure 4f) paral- 
leled the observed path (solid 
trajectory) but remained some 
240 miles too far north. 

The discrepancy in the place- 
ment of the low center is be- 
lieved to be associated with 
the spurious intensification of 
the anticyclone to the south- 
west. Since recalculation with 
shifted boundaries indicated 
that the intensification was not 
a boundary effect, and meas- 
urements showed impossibly 
large anticyclonic vorticities, 
it was decided that the inten- 
sification was probably due to 
the following mathematical 
simplification made in the 
model equations, a simplifica- 
tion that destroys the mecha- 
nism preventing the unlimited 
growth of anticyclonic vortic- 
ity: If nis thesum f + ¢ of 
the vertical components of the 
earth’s vorticity f and the rela- 


tive vorticity ¢, 6 is the potential temperature, and d/dt is the particle derivative, 
the basic quasi-geostrophic equation of motion may be written? 


or 


d 06\ 7 
dt\"ap} 
1 dy 06\~'d 06 | 
n dt Op} dt Op rm 


It follows immediately that if 7 is once positive at a particle it cannot subsequently 


become negative, i.e., ¢ cannot become smaller than —f. 


But if, as in the deriva- 


tion of equations (3) and (4), the denominators n and 06/Op are replaced by their 
respective mean values at the levels p,, one no longer restricts the algebraic value 
of n, which may then become negative or impossibly small. 

That the three-level 200-500-850 mb prediction considerably bettered the one- 


level, or autobarotropic, prediction can be seen from figure 5. 


This figure compares 
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the 500 mb 24-hour predicted charts from the initial time 1500Z November 24, 
with the observed chart for the same time. The position and orientation of the low 
as well as of the down-stream wedge are seen to be much more accurate in the 
three-level 500 mb forecast. 

An over-all comparison of the one-, two-, and three-level 24-hour predictions 
was made by computing the correlation coefficient, r, between the predicted and 
observed 24-hour changes for each of the models. The coefficients were computed 
for the one-level predictions at 500 mb, the two-level predictions at 700 mb, and 
the three-level predictions at 500 mb. The graphs in figure 6 show the respective 
coefficients plotted against the initial times. It will be observed that the level of 
accuracy for all three predictions was high during the period preceding the storm, 
i.e., until 15Z November 24, but remained high only in the case of the three-level 
predictions. 

5. Conclusions.—Following the predictions of the November 1950 storm a 
second series was carried out with the three-level model for a more typical, though 
less intense, development on the east coast of the United States. Again the cyclo- 
genesis was predicted. With this result as additional evidence the following 
tentative conclusions may be drawn: 

(1) The inception and development of cyclones are determinate and _ pre- 
dictable events, consequently the conception of cyclogenesis as the growth of an 
effectively random perturbation in an unstable uniform field of flow must be aban- 
doned. If the notion of hydrodynamic instability is to be retained at all, some 
such interpretation as the following would seem to fit the facts more closely: Since 
the basic flow is not steady and linear but transient, curved, and with variable cur- 
vature, the initial perturbation will have a preferred location in space and time, and 
its amplitude, though it may be small initially, will be entirely determined by the 
basic flow. It is like an automobile being pushed slowly but inexorably over a 
cliff. 

(2) Since the models used were quasi-geostrophic, the geostrophic approxi- 
mation is evidently not incompatible with the occurrence of cyclogenesis. Hence 
motions which are essentially non-geostrophic need not be assumed for the ex- 
planation of cyclogenesis. Thus, the Helmholtz-Solberg-Héiland type of insta- 
bility that occurs under certain conditions for axially-symmetric displacements 
of the fluid in a cylindrical vortex, or constant transverse displacements in a quasi- 
linear flow—displacements in which angular and linear momentum respectively 
are conserved—decidedly does not appear responsible for cyclogenesis. 

(3) Since the models contained no sources or sinks of heat we infer that 
these are not essential factors in the occurrence of extratropical cyclogenesis 
in at least a wide class of motions. This is not to say that they play no role at all, 
but that they do not play a decisive role. 

(4) The evidence drawn from the two-level predictions indicates that two 
parameters along the vertical do not sufficiently define the motion for an accu- 
rate prediction of cyclogenesis. It appears that one must somehow take into 
account the low-level thermal asymmetries, and for this one needs at least a three- 
parameter model. 


Acknowledgment.—The author wishes to thank Dr. Bruce Gilchrist who, assisted 
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ARITHMETIC GENERA AND THE THEOREM OF RIEMANN-ROCH 
FOR ALGEBRAIC VARIETIES 


By Frieprich HirzEBRUCH 
THE INSTITUTE FOR ADVANCED STUDY 
Communicated by S. Lefschetz, December 21, 1958 


Introduction.—In a preceding note! we posed the problem whether the arithmetic 
genus II(V,,) of a (non-singular) algebraic variety? coincides with the Todd genus 
T(V,). The purpose of the present note is to prove that this is actually the case. 
Moreover we prove a main theorem (./7) which gives a formula for the Euler- 
Poincaré characteristic of V, with respect to the cohomology® of V, with coefficients 
in the sheaf (faisceau) of local holomorphic cross-sections of any complex analytic 
bundle W over V,, which has the complex vector space C, as fibre and the linear 
group GL(q, C) as structure group. The main theorem expresses this Euler- 
Poincaré characteristic as a polynomial in the Chern classes of the tangential bundle 
of V,, and in the Chern classes of the bundle W. As special cases one gets the Todd 
formula I(V,,) = T(V,) and the “Theorem of Riemann-Roch for arbitrary dimen- 
sions.” The author wishes to extend his hearty thanks to Professors A. Borel, K. 
Kodaira, and D. C. Spencer, with all of whom he had many valuable discussions. 
The notes of Kodaira and Spencer‘ are essentially used. The author also wants to 
point out that the main theorem (in a slightly different formulation) was conjec- 
tured by J. P. Serre in a letter to Kodaira and Spencer. The proof of the main 
theorem uses the index theorem of the author! which involves essentially the theory 
of “cobordisme” due to R. Thom. Full details of the proof of the main theorem will 
appear elsewhere. 

1. The Main Theorem and Some of Its Consequences.—Let V, be an algebraic 
variety (non-singularly imbedded in some complex projective space). Let W be a 
complex analytic bundle over V, with the complex vector space C, as fibre and the 
linear group GL(q, C) as structure group. Denote by ¢; the Chern classes of the 
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tangential bundle of V, and by d; the Chern classes of the bundle W.  (¢,; « H?*(V,, 
Z),d;eH?(V,,Z),0 SiS n,0 Sj Sq, ce = dy) = 1.) We introduce the formal 
roots yi, 9): 


n 


n q q 
> ca = II (1 + y,), > dja’ = IL (1 + 6,2). 
i=0 , i=1 j=0 j=1 
Every formal power series which is symmetric in the y; as well as in the 6; will be 
considered as a power series in the Chern classes c; and d; and hence as an element of 
the cohomology ring of V,. Denoting by Q the rationals we define the operator x, 
on the cohomology ring H*(V,, Q) as follows: For every ue H*(V,, Q) we take the 
component of topological dimension 2n which we consider in the unique fashion as a 
rational number. This number is denoted by «,[w]. 

Main THEeoreM (M). Using the above notations we put 

n 
x(V,, W) = > (—1)' dim H*(V,, W), 
i=0 

where H'(V,,, W) is the 1-dimensional cohomology group of V,, with coefficients in the 
sheaf (faisceau) of local holomorphic cross-sections of W. We have 


1/2 as f2 
Kn je” (ch +e +... teh) T — sh 


;-18Inh y,/2 


Remark: x/sinh x is a power series in x”. Since the y; can be regarded as the 
formal roots of the Pontrjagin polynomial! of V,, we see that x(V,, W) is a poly- 
nomial in ¢,, the Pontrjagin classes of V,, and the Chern classes of the bundle W. 

Now let W be the trivial bundle over V, with C; as fibre. The main theorem 
implies? 

THEOREM |. The Todd genus of an algebraic variety is equal to the arithmetic 
genus: 


n 
N(V,) = d) (-1)'9; = T(V;). 
i=0 
More generally, let F be an arbitrary complex line bundle (fibre C,) over V,. 
Denote by f = c(F) its characteristic class (f « H*(V,, Z)). The main theorem 
implies 
THEOREM 2. We have 


x(V,, F) = vf ft Il = | 


:-1 sinh y,/2 


Hence x(V», F) is a polynomial in f + c,/2 and the Pontrjagin classes of V ,. 
Theorem 2 can be considered as a generalization of the theorem of Riemann- 
Roch to arbitrary dimensions. It contains the known Riemann-Roch theorems for 
n= 1,2,3. In the case n = 2 the term dim H'(V,, F) has to be identified with 
the superabundance of F. A similar remark applies to dim H}(V,, F) and dim 
H*(V,, F) in the case n = 3. In general, the term dim H"(V,, F) is equal to 
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dim H"(V,, K — F), where K is the canonical bundle of V,.. The characteristic 
class of K is equal to —c. Kodaira® proved that H'(V,, F) = 0 for i > 0, if the 
characteristic cohomology class of the bundle F — K “contains” a closed positive 
definite Hermitian form. In this case F — K is called ample. An arbitrary divisor 
D of V,, represents a complex line bundle,’ which we also denote by D. Obviously 
dim |\D| + 1 = dim H°%(V,, D). Denoting the cohomology class of D by d we get 
THEOREM 2*. /f D — K is ample (in the sense of Kodaira), then we have 


n /9 
dim |D| + 1 = x, iaes ie — i 
i=1Sinh y,/2 

(For n = 1, this is the classical formula: dim |D| = d — p, where c, = 2 — 2p.) 

Applying (M) to the case where W is the bundle of complex covariant p-vectors, 
yields® 

THEOREM 3. Let h’'* be the number of harmonic forms of type (p,q) on V,,. We 
have 


(V,) =x(Vi,W) = YS (-Vh = xy | (se Sing : n | 


q=0 im1e %—] 


: n — 
The sum on the right side has to be extended over all the ( possible combinations. 
p 


We attach to every algebraic variety the polynomial 


x(V.) = D> x*(Vi)y? 


p=0 


and get by easy calculations 


THEOREM 3*. We have 
n 
x(V,) = cof I] iro | 
i=1 


where 
Q(x) = = Tee — ya. 
Theorem 3* is for y = 0a restatement of Theorem 1. For y = —1 it gives that 
x-(V,) = pa cag” 
>,q=90 


is equal to the Kuler-Poincaré characteristic c, of V,. For y = 1 we obtain 
THEOREM 4. We have 


xn(V,)= ¥ (<1 = of > > |. 


p,q=90 i=1 tgh Vi 


Theorem 4 is known even for Kahler manifolds, because the left side of the equa- 
tion is equal to the index r(V,).8 The right side is also equal to 7(V,,) by the index 
theorem of the author.' The main theorem admits many further applications 
which we do not mention in this note. 
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2. A Sketch of the Proof of the Main Theorem.—In section | we derived the 
Theorems |—4 from the main theorem (M). Actually the proof of (M) goes almost 
in the opposite direction and involves the following steps. 

2.1. Proof of Theorem 1 by Means of the Known Theorem 4.—A complex manifold 
M,, is called a split manifold, if the structure group GL(m, C) of its tangential bundle 
can be reduced complex analytically to the group A(m, C) of all triangular matrices 
(all entries above the diagonal are 0). Let V, be an arbitrary (non-singular) alge- 
braic variety, the tangential bundle has GL(n, C) as structure group. Hence we 
can construct the associated bundle with GL(n, C)/A(n, C) as fibre. We obtain 
in this way an algebraic variety V>, of dimension m = n + '/on(n — 1). The fact 
that V,, is algebraic can be proved, for example, by using the general theorem of 
Kodaira® that every complex projective bundle over an algebraic variety is an 
algebraic variety. It can be proved that V,, is a split manifold. It is almost 
obvious that I1(V;,) = II(V,,). On the other hand, it follows from Theorem 4.2 
in the preceding note! that 7(V;,) = T(V,,). Hence it suffices to prove Theorem 1 
for all algebraic split varieties. Now let V;, be an arbitrary algebraic split variety. 
Then we have over V,,, an increasing sequence of bundles W; (in the sense of section 
1), where W, has C; as fibre and where W,, is the tangential bundle of V, Os 
1 < m). We denote the complex line bundle W,/W;_1 by A; (1 S i S m) and its 
cohomology class by a;. We use the notation of the preceding note and obtain 
readily the formula'® 


2”"T(V;,) = >? a ee (1) 


j=0 


rome ; m sep hes 
rhe inner sum has to be extended always over all the possible ( . ) combinations. 
J 


By the four-term formula of Kodaira-Spencer!! we define in the faisceau theory 
virtual indices x;._ This can be done for all algebraic varieties. (It can be done 
not only for x, but even for x,.) We know that x, = 7 for all non-singular alge- 
braic varieties, (Theorem 4). Since x; and 7 both fulfill the “functional equation” 
3.3 of the preceding note,! we get by an induction argument on the dimension that 
xi and 7 also coincide in the virtual case.” 

By some rather complicated calculations based on exact sequences of sheaves we 
can prove the following formula which corresponds to (1) 


2”"1(Vin) = Ex x1(A,,°A,,°. ..0A,,) (2) 
F 
Here the terms x; denote virtual indices.'* Formulae (1) and (2) imply Theorem | 
for split varieties, and hence Theorem | is proved in general. 

2.2. Proof of Theorem 2.—The virtual II and the virtual 7 both fulfill the 
“functional equation” 4.3 of the preceding note. Hence the usual induction argu- 
ment already used for x; in 2.1 proves that IJ and 7’ also coincide in the virtual case. 
This proves Theorem 2 (i.e., the main theorem for g = 1). Moreover, the main 
theorem follows immediately for all cases in which the bundle W admits the tri- 
angular group A(q, C) as structure group. 

2.3. Proof of the Main Theorem.—We use the notation of the first section. We 
construct over V, the bundle V;, with fibre GL(q, C)/A(q, C) which is associated 
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with W. The manifold V;, is, according to Kodaira’s theorem,’ algebraic (m = 
n+ '/og(q — 1)). We lift the bundle W up to V>, and call the lifted bundle W*. 
The bundle W* splits, i.e., admits A(q, C) as structure group. Hence Theorem 2 
implies that x(V;,, W*) is given correctly by the formula of the main theorem. A 
spectral sequence argument shows that x(V;,, W*) = x(V,, W). On the other 
hand, we can prove by a slight generalization of the Theorem 4.2 of the preceding 
note! that the formula of the main theorem gives for x(V»,, W*) and x(V,,, W) the 
same values. This concludes the proof of the main theorem. 

Remark.—The formulae (1) and (2) are special cases of more general formulae 
valid for x, with y regarded as indeterminate. The “functional equations” for 
Il = xo = T and for x; = 7 are special cases of a “functional equation” valid for x,. 


1 These PRocEEDINGS, 39, 951-956 (1953). We use the notations of the preceding note. 

2 Complex dimensions are indicated by a subscript. 

3 All the cohomology groups occurring in this note have finite dimensions. See Cartan, H., 
and Serre, J. P., Compt. rend. acad. sci., Paris, 237, 128-130 (1953), and Kodaira, K., these Pro- 
CEEDINGS, 39, 865-868 (1953). 

4 Kodaira, K., and Spencer, D. C., these ProcreEpiNGs, 39, 641-649, 868-877 (1953). K. 
Kodaira, loc. cit. in ref. 3. 

5 Here we use the theorem of Dolbeault (Compt. rend. acad. sci., Paris, 236, 175-177 (1953)) 
which states that H"(V,, W™) = H” *, where W™ is the bundle of covariant p-vectors and where 
H”: ‘is the linear space of all harmonic forms of type (p, g) on Vn. 

6 Kodaira, K., “On a Differential-Geometric Method in the Theory of Analytic Stacks,” these 
PROCEEDINGS, 39, 1268-1273 (1953). 

7 Conversely every complex line bundle can be represented by a divisor. See Kodaira, K., and 
Spencer, D. C., loc. cit. in ref. 4, p. 874. 

8 Hodge, W. V. D., Proc. Int. Congr. Math., I, 182-192 (1952), and Proc. Lond. Math. Soc. (3), 
1, 104-117 (1951). 

9 Kodaira, K., not yet published, 

10 The sum on the right side starts for 7 = 0 with 7(V*). The formula (1) was used by the 
author to prove Theorem 4.1 of the preceding note (loc. ct. in ref. 1). See also Hirzebruch, F., 
“The Index of an Oriented Manifold and the Todd Genus of an Almost Complex Manifold,” 
Notes, Princeton University, 1953 (mimeographed ). 

1 Kodaira, K., and Spencer, D. C., ““On a Theorem of Lefschetz and the Lemma of Enriques- 
Severi-Zariski,”’ these PROCEEDINGS, 39, 1273-1278 (1953), formula (14). 

12 The fact that x: fulfills the functional equation can be obtained from the four-term formula 
by some calculations. 

18 The small circle o denotes “virtual intersection.” 
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ON THE CONTINUOUS GOLD-MINING EQUATION 
By RicHARD BELLMAN AND SHERMAN LEHMAN 
THE RAND CORPORATION AND STANFORD UNIVERSITY 


Communicated by J. von Neumann, December 30, 1953 


$1. Introduction.—In some previous communications’ ? we have described 
some results obtained in the investigation of a dynamic programming problem, the 
“gold-mining”’ problem, which led to the functional equation 


N 
A: Dpi(ria + f( — rida, y) 

f(a, y) = Max 'v , w%y20, (1D 
B: Yadsy + f(a, (1 — sidy) 


t=1 


where pi, gi > 0, >opi, ogi < 1. The solution of this equation was given, inter alia, 
t t 

in ref. 1, and shown to have a relatively simple form. In addition, a partial solution 

of a more complicated equation, corresponding to a non-linear utility function, was 

given in ref. 2, having the same form. It is known, however, as a result of an un- 

published counter-example due to H. N. Shapiro and 8. Karlin, that the solution of 

more general equations such as 


A: pi(nx + f((1 — n)a, y) 
f(a, y) = Max “ (rey + f(x, (1 — re)y) (1.2) 
: pore + ray + f((1 — rs)a, (L — ray) 


0 <1, 72, 73, 71 < 1,0 < pi, G1, po < 1 cannot have the same simple form for all 
values of the parameters. 

In an effort to gain some insight into the structure of particular classes of solutions 
of (1.2), and similar equations of more complicated type, we have been led to con- 
sider some continuous analogues of these equations. There are many different 
procedures for obtaining these continuous analogues. One which we have followed 
to begin with leads to problems in the calculus of variations. 

An essential feature of our research lies in viewing a policy in its extensive 
rather than normal form, to borrow the terminology of game theory. Another way 
of stating this is that instead of determining the complete solution for one set of 
initial parameters, which would correspond to determining the extremal curve in 
the classical theory of the calculus of variations, we attack our problem by im- 
bedding it into the family of problems of this type with arbitrary initial parameters. 
This is the approach used throughout the theory of dynamic programming.!~* 
Having done this we determine an optimal continuation from each position, which 
upon being carried through yields an optimal policy. 

This approach, which may be considered a variant in problems of deterministic 
type, is in many ways a necessity in problems of stochastic type. It is possible to 
treat many of the classical problems in the calculus of variations by means of this 
technique. We shall return to this point at some future time. 

Guided by our knowledge of the solution in the discrete cases, and using the be- 
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havioristic approach described above, we have been able to solve completely and 
explicitly a variety of problems which are intractable in the original discrete form. 

In the following sections we shall discuss the simplest counterpart of (1.2) in 
continuous form, and list a number of typical results we have obtained. Following 
this we shall sketch briefly a formulation of the more general continuous version 
which results from processes corresponding to (1.1) and which requires more power- 
ful techniques. 

A more complete discussion and proofs of the results contained herein will appear 
elsewhere. Further results concerning more general problems discussed in ref. 3 
will be presented subsequently. 

$2. Formulation.—In the formulation of problems involving the use of continu- 
ous policies we are immediately faced with the difficulty of defining what we mean 
by a continuous mixed strategy, and of constructing the appropriate mathematical 
theory with which to handle this thorny concept. To circumvent these conceptual 
and mathematical difficulties, we shall utilize an idea emphasized in ref. 6, which— 
briefly put—is that for mathematical purposes, mixing at a point is to an arbitrary 
degree of approximation equivalent to mixing pure strategies in an interval about 
the point. 

Let us then consider a process where we are given two initial quantities, x and y, 
the gold mines of ref. 2; and two operations, A and B, mining operations. If A 
is used over a time interval 6, there is a probability 1 — qié + 0(6) that rivé + 0(6) 
is obtained and that the process is allowed to continue, with the new initial amounts 
x — rxv6 + 0(6), y; and a probability 9:6 + 0(6) that nothing is obtained and the 
process terminates. In a like manner, if B is used, there is a probability 1 — 26 + 
o0(6) that royé + 0(6) is obtained and the process continues; and a probability q26 + 
o(6) that the process terminates. 

To introduce the concept of mixing, we consider first the case in which the time 
interval is divided into intervals of length A, where Ais small. In a typical interval 
[t,t + Al, t = kA, the first part of the interval, [t, ¢ + A], will be devoted to the 
use of A; while the second part, [t + @A, ¢ + A], will be devoted to the use of B. 
In the limit, as A —~ 0, we obtain the effect of mixing A and B at ¢ in the ratio 
¢o::(1 — @)), ef. ref. 6 for further discussion. 

A strategy consists of a choice of ¢ for each of the points kA. We wish to de- 
termine the strategy which will maximize the expected value of the amount obtained 
before the process terminates. For any given strategy let 


v(t) = quantity of gold remaining in first mine provided that the process 
has continued until ¢, 

y(t) = quantity of gold remaining in second mine provided that the 
process has continued until ¢, 

p(t) = probability that the process continues at least to ¢, 

f(t) = expected amount obtained up to ¢. (2.1) 


Writing down the equations expressing x(t + A), y(é + A), p(t + A), f(t + A) in 
terms of the values at tf, and letting A > 0, we are led to the following system of 
differential equations: 
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Ix 
=. —¢ilb)riv(8), zal = 2, 
dt 
dy 

= —¢(trey(), y(o) = Yo, 
dt 
dp . 
rr —p(t)loilOqa + d2(b¢el, p(o) = | 
¢ 
if ‘ 
- = pii[a(Ona(t) + gry], flo) = 0, (2.2) 
( 


where 0 < gi < 1, g@ = 1 — g;. The problem is now to determine ¢,(/) so as to 
maximize f(). It is not difficult to give a proof based upon, say, weak conver- 
gence, which will assure us that the maximum is actually attained. As W. Fleming 
has kindly informed us, the existence of a maximum is guaranteed by a general 
theorem in the calculus of variations. 

Since the equations are fortunately non-linear, variational techniques are par- 
ticularly applicable. We find 

THEOREM 1. The maximum value of f( ©) is attained by the policy 


(a) If qnx > rey, dr = 1, 
(b) If mrey > gqrir, d = 1, 
(c) If qarix = grey, dr = 12/(r1 + 12), 62 = 1/ (11 + 12). (2:3) 


Note that the boundary line is, as might be expected, the set of points where 
expected gain over expected cost is the same for both choices, A and B. 
THEOREM 2. I[f T is finite, the optimal policy has one of the following six forms: 


(a) A always (d) A, then M, then A 
(b) B always (e) B, then M, then A 
(c) M followed by A (f) B followed by A. (2.4) 


This is for q, < q2; @ similar result holds for gz <q. The precise intervals within 
which each is used may be determined explicitly. Here M represents the choice given 
in (2.3¢). 

The optimal strategy represents a compromise between the long-term policy 
given in Theorem | and the short-term policy of maximizing expected gain. 

The 3-choice problem corresponding to (1.2) has the continuous analogue: 


dx 

dt oo [p(n + o3(t)rs|a(b), 

dy 

V1  le2(Ore + da(Oraly(O, 

dp 

dt —POloOn + gaz + ds(qs], 

if | 
2 = p(t) [(di(ri + bs(b)rs)a(Q) + (do(Hr2 + os(Ora)y(O |, fo) = 0, (2.5) 


Where 0 < gi, $2, os S 1, d1 + 2 + oO = 1. 


The maximum value of f(~) is provided in the general case by the policy repre- 
e J t=) . © 
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sented schematically by 








Depending upon the values of the parameters, one line is an absorbing barrier, 
which is to say, for (x, y) on the line a mixed policy is pursued which keeps the 
point on the line. This line has as its equation the equality of expected gain over 
expected cost. The other line will be a translucent barrier, causing only a change 
from ¢; = 1 to ¢; = 1, and is not defined by an equality of the above type. In 
special cases the middle region disappears and L, coincides with Ly. The solution is 
now that for the two-choice case. 

This last result is quite surprising and explains some of the difficulties of the dis- 
crete problem. One boundary, the absorbing barrier, is determined by a local con- 
dition, whereas the other is determined by a global condition. 

THEOREM 3. [f, in the two-choice problem described by (2.2), in place of eapected 
return, we seek to maximize the expected value of some function @ of the total return 
where is any strictly increasing function, the solution is that given by (2.3). 

To obtain this result we consider 


C= — Sf" o(x0 + fo — x(t) a y(t))dp(t), (2.7) 


the quantities being defined as in (2). 

The proofs of the above results are long and detailed, depending upon a precise 
analysis of the properties of an optimal policy. 

§3. More General Processes—Let us now consider the more general process 
corresponding to (1.1). Here the use of A leads to a variety of possible gains, and 
similarly for B. The quantities x(t) and y(t), as defined by (2.1) are now stochastic 
quantities. This means that it is no longer possible to obtain the equations of (2.2). 
Instead we must introduce the function F'(u, v, t) defined by the property that 


Priu < x(t) <ut+duv < yt) <vot+ dv} = F(u, v, t\dudv. (3.1) 


We may now, in a way similar to that followed in §2, derive a partial differential 
equation for F of the form 


oF oF oF 
= P(u,v _——. 3.2 
ot (u r + Q(u ”)> (3.2) 
The system of ordinary differential equations 
| ly 
> = P(u, v), . = Q(u, v) (3.3) 


connected with (3.2) will have a form similar to the first two equations in (2.2). 

The differential equations we have used to define our continuous processes bear 
the same relation to the rigorous integral equations defined by the original processes 
as the heat equation bears to the Chapman-Kolmogoroff equations. 
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Finally, let us note that the above formalism is also applicable to two-person 
multi-stage games of continuous type, and, in particular, to pursuit games. 

These extensions will be discussed in subsequent communications. 

! Bellman, R., “On the Theory of Dynamic Programming,’’ Proc. Natu. AcAb. Sct., 38, 716-719 
(1952). 

2 Bellman, R., “‘Sormhe Functional Equations in the Theory of Dynamic Programming,” /bid., 
39, 1077-1082 (1953). 

3 Bellman, R., ‘‘Bottleneck Problems and Dynamic Programming,’’ [bid., 39, 947-951 (1953). 

‘ Bellman, R., “A Problem in the Theory of Dynamic Programming,’ Econometrica (to appear). 

5 Bellman, R., “(Computational Problems in the Theory of Dynamic Programming,’ Proc. 
Symposium on Numerical Analysis, Santa Monica, 1953. 

6 Bellman, R., and Blackwell, D., “Some Two-Person Games Involving Bluffing,’’ Proc. Natt. 
Acap. Scr., 35, 600-605 (1949). 


ON A PARTICULAR TYPE OF CONVERGENCE TO A SINGULAR 
MATRIX 


By E. INonu Anp E. P. WIGNER 
PRINCETON UNIVERSITY 
Communicated December 14, 1953 


It was attempted, in an earlier investigation,' to consider limiting cases of con- 
tinuous groups G which are themselves groups. For this purpose, linear com- 
binations of the infinitesimal elements J of G were formed 


Ji = > (uip + ew;,)1, (1) 


in which wu was singular but the determinant ‘u + ew} did not vanish identically 
ine. It was attempted then to find the conditions for the J to remain infinitesimal 
operators of a group even in the limiting case «> 0. The group generated by the J 
in the limiting case was called a contraction of G. 

It was stated, in the course of the derivation of the conditions for the contrac- 
tion, that it is always possible to find two non-singular matrices 6 and a such that 

1 Ol e v Ol 
Bua" = 0 0 Bwa = 0 1° (2) 

In (2), the 1, 0, v are submatrices: the rank of the | in Bua~! is equal to that of w. 
This statement is, in general, incorrect? as can be seen already on the example of the 
two dimensional matrices 


When checking the calculation which led to the form (2), it was found that it 
contained a continuity assumption which appears reasonable but which was not 
stated explicitly. Since the continuity assumption in question may be of interest 
also apart from the problem of the contraction of groups, we shall formulate it be- 
low. 
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Let us consider two n dimensional vector spaces J and J and mappings U(e) of / 
onJ. For0< ¢€< &, the mapping shall be one-to-one; U(0) shall map J on a sub- 
space J, of J. We shall call the mapping U(e) reversibly continuous at « = 0 if 
U(0)U(«)~', applied to the vectors of /,, converges to the identical mapping of //,, 
on itself as « ~ 0. An equivalent formulation hereto is that the mapping U(0) 
U(e)~'U(0) of J unto J shall converge, as e— 0, to U(0). In fact, let 7 be a vector 
in J and write 7 = U(0)i. The convergence of U(0) U(e)~') ~ 7 is synonymous 
with the convergence of U(0) U(e)~'U(0)i to U(0)2. 

{f the convergence of u + ew to u is reversibly continuous, the existence of the 
normal forms (2) does indeed follow. Thus the addition of this assumption will 
guarantee the validity of the aforementioned considerations.! One can see this perhaps 
most simply as follows. Let us denote by J) the subspace of J for which uly) = 0. 
Similarly, let us denote the subspaces uJ and wl) by J, and Jo... We can first prove 
then that J, and Jo, together span the whole space J. In fact, if the vector 7 is in 
Io, wip = 0 would entail (wu + ew)io = 0 for all ¢ which is contrary to assumption. 
Furthermore, w7) cannot be contained in J, because if it were 


u(u + ew)—'wi) = (1/e)u(u + ew) "(u + ew)ty = (1/e)uty = 0 


would not converge to wip, contrary to the postulate of reversibly continuous con- 
vergence. Hence, w/) contains as many linearly independent vectors as J) and 
these are also linearly independent of the vectors in uJ. It follows that ul = J, 
together with w/, span the whole space J. If we choose, therefore, a coordinate 
system in J so that some of its axes 7, span Jo, the others 7, the rest of 7, we shall 
have 


Ulo, = 0 Wl, = Jox ui = Jur (3) 


The last two of these are the definitions of jo, and j,, which, together, span J. We 
can set, therefore 


Whe = Dard un + eeandon- (3a) 
X Xr 
Hence if we write 
luz = lie — > Zaston (3b) 
the 7,, and 7%, will span J and we have, in addition to (3) which remain valid 
Ulux = ape Wlux = DP rJua- (3¢) 
r 
Hence u and w will in fact have the form (2) if we choose in J the coordinate axes 7,,;, 
do, and in J the axes juz, jor: 
In matrix language, one can formulate our theorem as follows. Be wu and w 


n-dimensional square matrices and the rank r of u be smaller than n while u + ew be 
non-singular except for isolated «. Then the necessary and sufficient condition for 


u(u + ew)~'u—>u as e—> 0 (4) 


is that the coefficient of e’~’ in the determinant |w + ew| shall not vanish. In 
order to prove this, we first note that (4) remains invariant if one substitutes Bua 
for u and Bwa~'! for w where a and @ are non-singular and independent of «. Ac- 
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cording to our Theorem, if (4) is fulfilled, 8 and @ can be so chosen that (2) be ful- 
filled. This form shows, however, that the lowest term in the determinant |Bua™! 
+ Bwa i if considered as a polynomial in ¢, is e"~”’. Hence, the lowest term in the 
determinant |w + ew| is |6|—!|a\e"~’, i.e., the coefficient of ¢"~’ in the determinant 
u + ew} does not vanish. 

Conversely, let us assume that wu and w are of dimension n, u of rank r, and the 
coefficient of e”~’ in |w + ew| does not vanish. The same holds then for u’ = Bua! 
and w’ = Bwa~' and we can choose 8 and a so that u’ = Bua have the form given 
in (2). Furthermore, the algebraic complement of any element in the first r rows 
’. Hence, the ratio of such a complement to 
1 








of lu’ a ew’ | goes to zero at least as e”” 
the determinant |w’ + ew’|, i.e., any element in the first r rows of (u’ + ew’) 
converges to a finite number (or to 0) as e— 0. It follows that the elements in the 
first 7 rows of (u’ + ew’)~!u’ converge to the same numbers to which the corre- 
sponding elements of (w’ + ew’)~(u’ + ew’) converge. Hence, as far as the first 
r rows are concerned 
wu’ (ul + ew’) ul > u'(u! + ew’) '(u’ + ew’) = w’. (4a) 

However (4a) is clearly correct also as far as the last n — r rows are concerned be- 
cause these vanish on both sides because of the form (2) of u’ = Bua-'. Hence, 
(4a) follows from the premises and since a and 8 are independent of «, this holds 
also for (4). 


1 Tnonu, E., and Wigner, E. P., these ProcEEDINGs, 39, 510 (1953). 
* We are much indebted to Dr. V. Bargmann for his friendly comments concerning this point. 


ON NON-NEGATIVE-VALUED MATRICES* 
By Y. K. Wone 
PRINCETON UNIVERSITY 
Communicated by J. von Neumann, December 11, 1953 


Consider a matrix A = (a;,;) of m rows and n columns with non-negative elements. 
Define 


k 
3 = > ay; (j,k = 1,2, ..., n). (1) 
i=1 
We let s; = s;” for short, and let || A|; = max (s;, ...,8,). In particular, for a row 
vector a’ with non-negative coordinates a), ..., Gn, ||@’|| = max; (a). If Bisa 
column vector with non-negative coordinates b;, ..., bm, we have ||@)) = oid. 





The definition for ||A|| has the properties of a ‘‘norm” in Banach space. We use 
A(., k) to denote the kth column of A. 

Two square matrices are said to be equivalent to each other if one can be trans- 
formed into the other by permutations of the same rows and columns. 

TueoreM. Let A = (aj;) be a square matrix of order n with non-negative elements 
such that s; S 1 forj = 1,2, ...,n. Then the following properties are equivalent to 


one another. (i) I — A is non-singular. (ii) The matrix A is equivalent to one satis- 
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fying x <1 fork = 1,2, ...,n. (ii) For some h < nand e > 0, the matriz A satis- 

fies ||o'A"| Se <1, rb of eaa(gy is AS OIGy) lim,|| A®|| 1/4 <1. (v) The matrix 
k-1 

A is equivalent to one satisfying >> san <1 — ay fork = 2,3, ...,n,and0<1 - 
i=1 

ay}. 


We shall show that the conditions (2), (77), (zz), (2v) imply one another in cyclic 
order, and (77) implies (v), (v) implies (7). (2) implies (i): If all the sums s; were 
equal to 1, then 1 — Yan = 0 for k = 1, ..., n, which would be the same as 
(1,1, ..., 1) — A) = 0. In other words, the n rows of I — A would be linearly 
dependent, and J — A would be singular. Thus, condition (2) implies that one 
sum, say s,, is less than 1. Nowd, = det (I — A) > 0 implies that d, > 0, where 
d,. is the determinant of the submatrix J — A, consisting of the first k rows and 
columns of J — A (ref. 3, p. 237, Corollary 2). Thus d, > 0 implies the existence 
of a column in A, such that the sum of its elements is less than 1. Let such a 
column be the kth one of A;, where k = n — 1,n — 2,...,1. This proves that (7) 
implies (72). (¢) implies (iii): If every element below the principal diagonal, a, 
n j 

with i > k, are zeros, then by (ii), we have s; = )oaij = Doiay < 1 forj = 1, 2, 
1 1 

,n. This implies that | A|| < 1 and hence the conclusion. Consider now a; for 


? > k are not all zeros. If on the (n — 1)th column, a,, ,) = Hs then s‘”, = 
s">? < 1 by (i). We, thus, have two columns A(-, n — 1), A(-, ») with sums 


8,1<1,8,< 1. If ay, ,1 > 0, then by (2), s < 1 and — <i g e 
dled that o'A(-,n — 1) = Yosay, na < Dai, n-1 S 1 by hypothesis. In 
either case, we have o’A(-,n. — 1) < 1. If either a,, ,-2 or @,_1, ,—2 iS not zero, 
then o’A(-,n — 2) < lor o’A*(-,n — 2) < 1, respectively. Hence, o’A*(-, 7) < 1 
forj=n—-2,n-—I1,n. C cntemnng this type of reasoning, we can determine 
an integer h S n — 1 such that 0’A*(-, 7) < 1 for allj = 1, 2, ...,. Conse- 
quently, there exists an e such that 0 < e < 1 and ||o’A"|| < e. This proves that 
(ii) implies (iii). (diz) implies (iv): Let ay = log ||o’A?—) |A?||. Then 
a = Oanda,,, S a, + a,. Pélya and Szegé (ref. 2, Problem 98) show that lim, 








(a,/p) = inf, (a,/p). By hypothesis, we have a, < 0if p— 12 h. Hence a,/p < 
0 for all p = h + 1, and b = inf, (a,/p) <0. It follows that lim, ||A?||'/? = e < 1. 
In fact, e’ is 0 or 0 < e? < 1 according tob = —” orb > —o. This proves that 
(ii?) implies (7v). That (iv) implies (7) is well known in Banach algebras. Evi- 
dently, (v) follows from (77). Finally, that (v) implies (7) follows from inequality 
(9) in (ref. 4, p. 139). This completes the proof of the theorem. 

Another obvious equivalent condition is that all the proper values of A lie in a 
unit circle with center at the origin. 

A matrix A is said to be quasi-nilpotent if lim ||A%|'"7 = 0. For a finite square 
matrix, quasi-nilpotency implies nilpotency and anaes 

Coro.uary. Let A be a square non-negative matrix of order n such that s; S 1 
for 7 = 1,2, ...,n. If A is not nilpotent, and has a simple proper value, 4, with the 
greatest modulus, then 


0< A, = lim, | Aq!" = e, b = inf, [p~log || A7||] > — @. 


For, by Frobenius’ result,' \; is non-negative. Since A is not nilpotent, A; > 0 
As }, is simple by hypothesis, it follows, from Syvester’s spectral representation of 
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a matrix, that, for sufficiently large p, 


i ee k)|} or 
ee Ee 

where k = 1, 2,..., 7. We can show that (A; — €)|| A?|| S$ ||A't) S (+ €)|| A?||. 
This means that A, = lim ||A?+'\|/||A?|_ = lim ||A?|/!/?, which proves our result. 

Frobenius shows that an indecomposable, non-negative matrix has a simple, 
positive maximum proper value. Thus, the Theorem of Vivanti and Pringsheim 
on power series with non-negative coefficients may be extended to a power series 
>A’ with the powers of an indecomposable non-negative matrix A. 

Let A be a square matrix of order n with non-negative elements. Then ¢ (real or 
complex) is in the resolvent set of A if 1 — A is regular. Let B = |t|~'A; we 
may apply the preceding Theorem to — B. The convergence of >>| ¢|~*~'A* im- 
plies that of }(¢-*-'A*. For simplicity, we consider? > 0. If A and B are square 
matrices with non-negative elements and satisfy the second Hilbert resolvent equa- 
tion with respect to ¢, then every pair of matrices of the form aA + (1 — a@)B, 
where 0 < a@ S 1, satisfies also the second resolvent equation with respect to ¢. 

k 

The proof makes use of property (i) in the above Theorem by verifying ¢~! > 

i=1 

(wax + (1 — ajax) << 1for0 < a <1. This proves the convexity property of 
the set of non-negative matrices satisfying the second resolvent equation. 


Applications.—We assume the regularity of J — A. If ||A|) < 1, then 
s, n 8; ) 
= Ax(t,7) S ; (2 
i— lal, = 9) ST Tq] 


where Ax = A+ A? +... + A? + ..., and |/Ali,; = min (8, ..., 8,). 

The preceding inequalities are not useful in applications, as |A|) = 1 arises fre- 
quently in practical problems. To overcome the difficulty arising from the unit 
modulus, we apply property (777) in the above Theorem to determine an integer 
h<nsuch that ||o’A"|| < 1, where o’ = (8, ..., 8) = (1,1,...,1)A = ¢'A. We 
ean verify that 

" h e h 
oI +A + see + A") St'h < o~—+A+t apa A ) (3) 
1 — |lo'A"|, 1 — |\o'A"| 





Inequalities (3) are very significant for computation, as h is 1 or 2 in practical prob- 
lems. When h = 0, the above two inequalities reduces to (2). If || A| < 1, then 
(3) holds for every h, and hence they may serve as refinements of (2). 

Let 8’ = (b,, ..., b,) be a vector with non-negative coordinates. Assuming the 
existence of (J — A)~!, we have (I — A)~!' = 1 + Ax. With the aid of (2) or (3), 
we estimate the elements of B’As and hence of B’(J — A)! = B’(1 + Ax). The 
approximations for B’A* are applicable to some expressions involving the resolvent 
of A by noting that «(7 — A)~! = I + (t-'A)x fort > 0. Moreover, inequality 
relations concerning the elements on each row of the resolvent (42 — A)~'!, t > 0, 
can be determined. Since Ax — A = AxA = AAx, we have 


B'Aw = B'( + As)A = B'A + B’AAs S |l8'llo’ + llp’Alle’As, (4) 
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B’As = B’A(E + Aa) S |[B’Allg’ + Aw) = ||6°All(’ + oe), — () 
B’Ax = ||B’All Ag’ + ¢'Ax), (6) 
= 


B’Ax = ||6’Allr||A|| 0’ + |[B’ Al] ¢’As, (7) 
where |/8’A r = min (6’A(.,1),...,8’A(.,n)). We obtain (7) from (6), as | Al| Ig! 
< ¢’. Thus, (4) and (7) furnish an upper and a lower bound of similar nature. 
Since || ||, is usually very small, the value ||8"A|| || All, would be too large to render 
B'Al|-|Al|,~o’ + ||B°Al|¢’d* a useful approximation for an upper bound. If 
we replace the moduli in (4) by the lower moduli, we would not secure a good lower 
bound in comparison with (6), since ||'||, = 0 in practical problems. We may sub- 
stitute (3) into (4)-(7). In ease |All < 1, we obtain by virtue of (2) 


a'All. (2 + 0 =r) B’Ax  ||B'Al| (: + a rai) (8) 


: . |a"All 
a'alle(ay +a) © 8's 5 (lott + HT) o (9) 
+4} — Alle ie 





The inequalities in (8) follow from (5) and (6), while those in (9) follow from (4) 
and (7). The inequalities (8) and the upper bound in (9) may be simplified as fol- 
lows: 

| | 
sermon eehgte 


~ t= ilAll 


|| || 


B’All, 
of £8 As & 
“oe |All,” GAs ie 








Observe that the lower and upper bounds for ¢'A* given by (2) usually render 
poor estimates unless ||A|| is sufficiently small. _ If 


|All <1 — |[6'l|/e, ¢ = |[6’ + o'l, (11) 


then ||@’|/(1 — ||A||) < e¢, and the last inequality in (10) gives a better estimate 
than B’Ax < co’, which is the same as inequality (30) in reference 3, page 239. In 
practical problems, ¢ usually is at most 1. If (11) holds, thene <1. For, if ¢ = 1, 
it would imply that ¢ < ||A/| + eal < 1, contradicting toc = 1. 

The inequalities (4)-(7) together with (3) are simple for practical problems. 
These inequalities will improve the lower and upper bounds for determinants in 








reference 3. 


* The paper was prepared under the Navy contract N6onr-27009. 

| Frobenius, G., “Uber Matrizen aus nicht-negativen Elementen,” Sitzber. kgl. preuss. Akad. 
Wiss. zu Berlin, pp. 456-477 (1912). 

2 Pélya and Szegé, Aufgaben und Lehrsdtze aus der Analysis, Vol. I, J. Springer, Berlin (1925). 

3 Wong, Y. K., “Some Inequalities of Determinants of Minkowski Type,” Duke Math. J., 19, 
231-241 (1952). 

* Wong, Y. K., ‘An Inequality for Minkowski Matrices,’ Proc. Am. Math, Soc., 4, 137-141 
(1953). 
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SOME RELATIONS FOR CRYSTALS WITH SUBSTRUCTURES 
By M. J. BUERGER 


CRYSTALLOGRAPHIC LABORATORY, MASSACHUSETTS INSTITUTE OF TECHNOLOGY 
Communicated December 18, 1953 


Introduction.—In many a complex crystal structure there can be recognized a 
set of atoms arranged in a pattern having simpler geometrical characteristics than 
the entire pattern. Such a set may be defined as a substructure. Familiar examples 
of this relation are found in crystals whose structures are based upon close packing 
of the larger atoms. Thus in kyanite, Al,SiO;, the oxygen atoms can be described 
as being arranged in cubic close packing. These atoms, considered alone, consti- 
tute a substructure which, ideally, has a smaller cell and higher symmetry than the 
complete structure of kyanite. The larger cell and reduced symmetry of the 
structure as a whole is due to the way the metal atoms are distributed among the 
interstices of the oxygen atoms. From a symmetry point of view, a crystal strue- 
ture and its substructure (or substructures) conform to derivative structure theory :! 
the substructure is a basic structure while the entire crystal structure is a derivative 
structure, and has a derivative symmetry. 

The presence of a substructure gives rise to specialized x-ray diffraction effects. 
Under favorable circumstances these effects are obvious and the existence of the 
substructure is accordingly revealed. In these cases, both the Patterson map and 
reciprocal lattice itself have specialized features which may be recognized. 

Properties of the Vector Set of a Structure Containing a Sub structure.—Let a set of 
points be decomposed into a subset and its complement subset. Then the vector 
set? of the entire set contains three vector subsets: the vector set of the subset, 
the vector set of the complement subset, and the cross products of the subset and 
complement subset. Since the substructure corresponds to a subset, it follows 
that the Patterson of a crystal with a substructure contains the Patterson of the 
substructure. 

Now it is an elementary consequence of derivative-structure theory that the 
density of coincidence operations in the basic structure is a multiple of the density 
of coincidence operations in the derivative structure. Thus the weighting of the 
substructure peaks is a multiple of the weighting of the other peaks of the vector 
set, and therefore, other things being equal, the substructure peaks tend to be the 
most prominent features of the Patterson. Furthermore, in the commonest case, 
the primitive cell of the entire structure is a multiple, n, of the primitive cell of the 
substructure. The cells of the vector sets of these structures, of course, bear the 
same relation to one another.? This means that while one origin peak of the sub- 
structure coincides with the Patterson origin of the entire structure, the (n — 1) 
intense origin peaks of the Patterson of the substructure occur within the cell of the 
Patterson of the entire structure. Thus, for a crystal containing a substructure, 
the most prominent feature of the Patterson is usually a set of strong peaks out- 
lining the lattice of the substructure. 

Conversely, the existence of a set of approximately equal peaks outlining a sub- 
lattice in a Patterson is evidence suggesting the existence of a substructure having 
that sublattice. For example, if one observes within the Patterson cell a set of 
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prominent peaks corresponding to a cubic face-centered lattice of appropriate 
translation, he has reason to suspect a substructure of atoms (presumably large elec- 
tronegative atoms such as oxygen or sulfur) in cubic close packing. 

One can go further than this. Since the Patterson of the substructure has the 
periodicity of the subcell, the vectors pertaining to the substructure are repeated 
along with the origins of the subcell. In other words, when the crystal contains a 
substructure its Patterson has a recognizable subperiodicity within the whole cell. 
It is possible to study this sub-Patterson intuitively, or it can be treated more system- 
atically. For example, one can construct a minimum function® * for the n points 
of the sublattice, and let this function range the subcell. This minimum function 
lays down a value which approaches (i.e., is greater than or equal to) the Patterson 
of the substructure alone plus the cross terms, both of which have the periodicity 
of the subeell. This result can then be subtracted from the rest of the Patterson, 
leaving, as a residue, the Patterson of the complement structure alone. Since the 
substructure is usually easy to interpret, or even to guess, this may permit a com- 
plicated Patterson to be interpreted. 

The Reciprocal Lattice of a Structure Containing a Substructure.—While the Patter- 
son is probably the most powerful tool for investigating a crystal with a substructure, 
a good deal of information can be obtained from a mere inspection of a photograph 
of the reciprocal lattice® since, of course, the Patterson is derived from the arrange- 
ment and absolute weightings of the points of the reciprocal lattice. As pointed 
out above, the vector multiplicity in the substructure is comparatively high, and 
therefore the reciprocal terms contributing to the origins of the vector set of the 
substructure tend to be strong. This means that there should be heavily weighted 
points in the reciprocal lattice, especially near the origin, corresponding to the 
origin points of the substructure of the Patterson. Hence it is evident that al- 
though there is a small cell for the reciprocal lattice as a whole, yet certain points, 
especially near the origin, are prominent and mark out a super reciprocal cell. This 
supercell corresponds to the subcell (i.e., the cell of the substructure) in crystal 
space. 

Figures 1 and 2 outline the relations involved in the lattices, sublattices, and 
superlattices of direct and reciprocal space. Figure 1 shows the lattice and sub- 
lattice in direct space, while figure 2 shows the corresponding cell and supercell in 
the reciprocal space. For simplicity the illustrations are two-dimensional. 

In figure 1, the large cell A,Az is the cell of the crystal. Within it is a substructure 
whose cell a,a2 is a submultiple ('/i2 the area in the illustration) of the cell of the 
whole crystal. The subcells are delimited by planes (Ajk;) and (hok2). In the 
illustration these are taken as (31) and (14), respectively. 

In figure 2 the cells reciprocal to those of figure 1 are shown. The cell reciprocal 
to A,A, is the small cell A,*A.*. To find the cell reciprocal to the subcell aya, it 
is merely necessary to find the translations reciprocal to d,,;, and d),,,, specifically 
[Aik |* and [hok.]*. Thus the cell in reciprocal space defined by the reciprocal 
lattice points [[hyk,]]* and [[hek2]]* constitutes a supercell corresponding to the 
substructure, and these points may be expected to be prominent in reciprocal space. 

Figures | and 2 are mutually reciprocal and can be interchanged; for example, 
figure 1 may be regarded as the reciprocal lattice picture from which the crystal 
lattice and its substructure of figure 2 can be derived. Note that one picture corre- 


a 


AAPL AGA see na, 


meres Leer 


CANAAN IE SRR BIE 0 


~~ 











al 
nd 
b- 


in 


ire 


he 
he 











oneteeNL ea RN TS 


re a Sons oe ee 


separ 
































FIGURES 1 (ABOVE) AND 2 


sponds to the projection of the crystal cell, the other to the photograph of the recip- 
rocal lattice. The large cell of one is similar to® the small cell of the other if rotated 
90° and vice versa. 

Conclusions.—When a crystal contains a substructure the fact can often be de- 
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tected not only from its Patterson but also often from the photographs of its recip- 
rocal lattice. The Patterson can be decomposed by means of a sublattice minimum 
function and manipulated so that the Patterson of the structure not concerned with 
the substructure is independently available for investigation. 

! Buerger, M. J., “‘Derivative Crystal Structures,” J. Chem. Phys., 15, 1-16 (1947). 
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5 Buerger, M. J., “Photography of the Reciprocal Lattice,’ Am. Soc. for X-Ray and Electron 
Diffraction, Monograph Number 1 (1944). 

6 Buerger, M. J., X-Ray Crystallography, John Wiley & Sons, Inc., New York, 1942, p. 116. 


ON DIFFRACTION BY A STRIP* 
By A. Erperyr anp C. H. Papas 
CALIFORNIA INSTITUTE OF TECHNOLOGY 

Communicated by P. S. Epstein, December 14, 1953 


Introduction.—The problem of diffraction by an infinite strip or slit has been 
the subject of several investigations.' There are at least two “exact”’ methods for 
attacking this problem. One of these is the integral equation method,’ the other 
the Fourier-Lamé method.’ The integral equation obtained for this problem 
cannot be solved in closed form: expansion of the solution in powers of the ratio 
(strip width/wavelength) leads to useful formulas for low frequencies. In the 
Fourier-Lamé method the wave equation is separated in coordinates of the elliptic 
cylinder, the solution appears as an infinite series of Mathieu functions, and the 
usefulness of the result is limited by the convergence of these infinite series, and by 
the available tabulation of Mathieu functions. 

The variational technique developed by Levine and Schwinger avoids some of the 
difficulties of the above-mentioned methods and, at least in principle, is capable of 
furnishing good approximations for all frequency-ranges. The scattered field may 
be represented as the effect of the current induced in the strip, and it has been 
proved by Levine and Schwinger‘ that it is possible to represent the amplitude of 
the far-zone scattered field in terms of the induced current in a form which is sta- 
tionary with respect to small variations of the current about the true current. 
Substitution, in this representation, of a rough approximation for the current may 
give a remarkably good approximation of the far-zone scattered field amplitude. 
In this note we assume a normally incident field polarized parallel to the generators 
of the strip. As a rough approximation, we take a uniform density of the current 
induced in the strip. Since the incident magnetic field is constant over the strip, 
Fock’s theory® may be cited in support of the uniformity of the current distribution, 
except near the edges where the behaviour of the field® indicates an infinite current 
density. A more detailed analysis of the current, by Moullin and Phillips,’ is avail- 
able but was not used here. 
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Once the (approximate) amplitude of the far-zone field has been obtained, the 
scattering cross-section may be found by the application of the scattering the- 
orem* 7 § which relates this cross-section to the imaginary part of the amplitude of 
the far-zone scattered field along the central line of the umbral region. In spite of 
the crude approximation adopted for the induced current, the scattering cross- 
section shows a fair agreement with other available results. 

Integral Equation.—We assume a plane wave with harmonic time dependence 
exp (—iwt), normally incident on the perfectly conducting infinitesimally thin 
strip of width 2a, 

z=0, —a<y<a. (1) 
We further assume that the incident wave is polarized parallel to the edges of the 
strip (i.e., to the x axis) so that the only nonvanishing components of its electric 
field E, magnetic field H, and complex Poynting vector S (all measured in MKS 
units, bars denoting conjugate complex numbers) are 


E, = exp (tkz) (2) 
H,’ = k/(wm) exp (tkz) (3) 
S.' = E,'H,' = k/(wp) (4) 


The scattered electric field is again parallel to the x axis, and the total electric 
field is the sum of the incident and seattered field, 


, 


E,(y, z) = E,‘(y, z) + E,*(y, 2) (5) 
The boundary condition on the screen is the vanishing of the electric field 

E,(y, 09) = 0 for -a<y<a; (6) 
and the scattered field, E,*°, must represent, at large distances from the screen, an 


outgoing cylindrical wave (Sommerfeld’s radiation condition). 
The expression of the total electric field in terms of the induced current K,(y) is 


E,(y, 2) = exp (ikz) — wp/4 S Ho®(kl(y — y’)? + 2) Ky’)dy’ (7) 


M4 9 9}1/o . . . . 2 . 
1/4H)(k[(y — y’)? + 27] ’*) being the two-dimensional free-space Green’s function. 
The scattered field in (7) certainly satisfies Sommerfeld’s radiation condition; in 
order that it also satisfies (6), we must have 


L = wp/4 J H(kly — y'|)K,(y’)dy’ yi) <a (8) 


and this is the integral equation of our problem. 
Far-Zone Scattered Field Amplitude.—We define the far-zone scattered field ampli- 
tude A(¢) by 
E, ~ i/4[2/(aikp) |" exp (ikp)A(o) kp—> (9) 


where p cos @¢ = z and psin@ = —y; this expression represents a cylindrical out- 
going wave of “amplitude” A. 
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Since 





Hy(k{(y — y’)? + 27]'”) ~ [2/(wikp)]'" exp (ikp + iky’ sin ¢) (10) 


when kp > , we have from (7), for large kp, 





E,°° ~ i/4[2/(aikp) |” exp (ikp) [ iwu exp (iky’ sin $)K,(y’)dy’ (11) 


— 









Comparing (9) and (11) we get 


A(o) = twu JS exp (iky’ sin 6) K,(y’)dy’ (12) 


=—¢ 




















A Scattering Theorem.—lf{ P denotes the scattered energy flux per unit length, 
then the scattering cross-section o is defined by 
a0 = P/S (13) 
where S = 1/2 Re S,' = k/(2wy) is the incident energy flux per unit area. 
a a 
P= 1/2Re JS E,'(y’, O)K,(y’)dy’ — 1/2 Re J K,(y’)dy’ (14) 


—@ 





since EF,’ = 1 at z 0. Hence, 


o = wu/k Re SJ K,(y’)dy’ (15) 


—a 


But from (12) 


A(O) = ion [ K,(y’)dy’ (16) 


Therefore, comparing (15) and (16) we get 
o = Im A(0)/k (17) 


It may be remarked that, in the limit for very high frequencies, the cross-section 
(17) turns out to be twice the projected area. At first sight the factor two may 
seem inconsistent with geometrical optics, but actually the discrepancy is due to the 
different definitions of the scattering cross section.® 

Variational Principle.—We multiply the integral equation (8) by K,(y), integrate 

a 
from y = —atoy = +a, divide by the square of /” K,(y)dy, and then recall (16). 
—=¢@ 


Thus we obtain 


| y 4 Kandy | 


A(0) = 44 ——— . : 
SS KAy Ho (ky — y'\)Ki(y’)dy dy’ 


=i —=@ 


(18) 





It is easy to show that the expression (18) is stationary with respect to small 
variations of K, about the true K, which satisfies the integral equation (8). Con- 
sequently, if we substitute a reasonable trial function for the unknown K, in (18), 
we expect to obtain a reasonably accurate value of A(0). 
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Trial Function.—According to Fock’s theory® the current induced in the central 
region of the strip is equal to twice the tangential component of the incident mag- 
netic field. And according to Bouwkamp,’ H, must have a singularity like 








(a? — y*)~-"? at the edges. This certainly conforms to the detailed information 
Moullin and Phillips* reported. In the present paper we take for a trial function 
K,(y) = 1, thus ignoring the effect of the edges. 
1.5 
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FIGURE 1 


Plot of o«/(4a) versus ka. Solid curve is variational solution. Broken curve is Morse 
and Rubinstein’s. 


Scattering Cross Section.—With K,(y) = 1, computation shows that (18) becomes 


2ka 
ee H)(jdt — H,(2ka) + (wika)— (19) 


0 


[A(0)]-! = (4%ak)— 


And applying the theorem (17) to (19), we get 

2ka 

JS. Jo(t)dt — Jy(2ka) 
a/(4a) = = 


2ka 2 2ka 2 
| Ji(t)dt — J.(2ke) | + |S Y,(Q)dt — Y,(2ka) — (aka) 1] 
0 0 
(20) 


The integrals appearing here have been tabulated." 

Results—A plot of o/(4a) versus ka is shown in the graph. This is a plot of 
equation (20). For small ka we have Rayleigh scattering. As ka increases the 
curve performs a damped oscillation about ¢/(4a) = 1. And for ka— o it can be 
shown by means of asymptotic representations of Bessel functions and their inte- 
grals that o —> 4a. 

According to Babinet’s principle'! the problem we have discussed is comple- 
mentary to the problem of scattering by a slit for a normally incident wave polar- 











132 PHYSICS: J. SCHWINGER Proc. N. A. 8: 


ized perpendicular to the axis of the slit. For intermediate values of ka our curve 
behaves qualitatively as Morse and Rubinstein’s:' the quantitative agreement is 
less good, the deviation being due to our choice of an overly simplified trial function. 
It is quite remarkable how such a rough approximation of the induced current 
yields fairly good results over the entire spectrum. 

* Most of the work discussed in this note was supported by the Office of Naval Research. 
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THE QUANTUM CORRECTION IN THE RADIATION BY ENERGETIC 
ACCELERATED ELECTRONS 
By JULIAN SCHWINGER 
HARVARD UNIVERSITY 
Communicated December 30, 1953 
The radiation emitted by high energy electrons in a large scale magnetic field is 
an essentially classical phenomenon, and was so discussed in a paper! of the author. 
It was there remarked that the condition for quantum effects to be unimportant is 


that the momenta of the radiated quanta be small compared with the electron 
momentum. At very high electron energies, this requirement reads 


h<<E, © ~ w(E/me?), (1) 
where 


w) = c/R = (eH/mc)(mc?/E) (2) 


is the rotational frequency in the orbit determined by the magnetic field H. Hence 
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; the magnitude of the quantum correction should be indicated by 


=( E ) = h/ ml E ) 
. E \me? R \me? 


(eh/mc)H E : 
 aeanieen ane” (3) 


me? mec? 





; which is extremely small for all existing or contemplated magnetic devices. Re- 
cently, there have been several attempts at a quantum mechanical calculation? or 
diseussion® of this emission process, which have led to varying degrees of doubt 
concerning the validity of (3) as a measure of the quantum correction. To aid in 
settling this question, we shall present an explicit calculation of the quantum 
e correction, to the first order in h. 

At this level of accuracy, the spin degrees of freedom play no role for unpolarized 
particles. With a suitable canonical transformation, the Dirac Hamiltonian can be | 


replaced by the classical 
i= |e(p _ “a) +- ome’) (4) 
c 


for a field described by a vector potential (this includes the magnetic field and the 
radiation field). The probability for emitting a photon, during the time interval 
T = t, — ty, into the range (dk) about the propagation vector k and with polarization 
vector e@, is obtained as the absolute squared matrix element of the operator 


Ik) 2] 
E eq 0 fate (e-v(), ec"), (5) 
oT @ 2 


where v(t) is the particle velocity, and( =, __) indicates the symmetrized product : 








i of the operators. This matrix element refers to specific initial and final particle 
4 states. On summing over all final particle states, we obtain the probability for the 
5 radiation process as an expectation value computed for the initial particle state, | 
FA | 
(dk) 2c" — iw(t — t’) ek: r(t) -ik-r(t’) n | 
3 — “ate (e- v(t )(e- v(t’), e )» (6) 
3 (27)8 a. te 
: 
: If we are not concerned with the radiation of a specific polarization, one should ) 


s replace e- v(t)e- v(t’) with nXv(t)-nXv(t’) = v(t)- v(t’) — n-v(é)n- v(t’), where n 

t is the unit vector directed along k. The relation 

s 

n (7k- v(t), ef) = (d/dt)e**® (7) } 


then enables us to write the probability for radiation, without regard to polarization, 
as 


(dk) c* | , 
(2n)* 7 fra fare aes (v(t), e ).(v(!), emit) 
“oT7)* a) e 


pik T(t), — te -r(e ). (8) 
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For a sufficiently extended time interval 7’, we can introduce new time variables 
t—>t + '/or, t’ > t — '/or, and regard the ¢ integrand as the probability per unit 
time of the event. On multiplication with fw we find that the average power 
radiated into a unit solid angle enclosing n, and contained in a unit frequency 
interval about w, is 


Po a. 1\. ms 
P(n, w, t) = of dre “*" vit + —r), Pt ).( ye — =r), 
dg? ¢ J m ¢? 2 2 
aad —_ ei ttt 27) ee (9) 


If the order of operator multiplication is not significant, this result is equivalent to 
the classical formula (1.38) of reference 1. 

We shall now investigate the commutation properties of the operators r(¢) and 
v(t), to the first order inf. Thus, the equation 


(v, H) = (p 2 “a)e (10) 
c 


is solved, to this accuracy, by 
from which we derive 

c 

Hf 


y= (a, (p =e “A)e*), (11) 
0 2 
[7;, v; | = th : = th ap ome ce voi (12) 


Consistent with the approximation program, we have on the right replaced the 
operator H by the classical determinate energy EL, and disregarded the non-com- 
mutativity of v; and v;. The latter is described, to the same approximation, by 


l oC l 
sexe = i] 1 — oop + Sw | (13) 
c yy? c? 


The classical motion occurs in the plane perpendicular to the magnetic field, 
v‘H = 0, so that (13) implies the following uncertainty principle for the velocity 
components in this plane, 


| 1. ecH 
, AAr > =~ (1 — 0°/c?), (14) 
Cc 2 
from which we infer 
l , cH h . 
s{(A0)? + (Av2)?] > #1 = v8/et) = PL = v8/e2). (15) 
c y? y 


Since the particle speed occurs significantly only in the combination 1 — v?/c?, the 
effect of the non-simultaneous specifiability of v; and vp is measured by the co- 
efficient fia,/H = ehcH/E?, which is negligible compared to (3), and decreases 
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with increasing energy. Hence, at sufficiently high energies, the components of v 
are determinate quantities. 


The symmetrized products of v and e*"** can be replaced by 
he? l Nei 
(v, &) =|v— —(k-— viv] eit (16) 
* 2E c? 
and 
hic? | Ss 
(v, ee) = eT) y — —(k — — vk-v) |. (17) 
2E c? 
- - P re ‘kk. 1/o an ditcettan2ife r 
We then combine the contiguous quantities, e"!t’”” and e~™™*~'””?, with the 
£ | 
aid of the theorem 
oAeB = pAtB,VilABL (18) 


which is exact if [A, B] commutes with A and B, and suffices for the purposes of the 
approximation we are employing. Since the values of + that contribute to the 
integral (9) are such that wr(1 — v?/c?) ~ wor(1 — v?/c?) ~ 2? ~ 1, we have wr<<l, 
and one can use the expansion 


l l l | (ae 
— 2 3 ( 
r( oa 57) = rt) + 5 THU) o- 3 r’v(t) + Td v(t). (19) 
Thus 
(0+: ) ( 3 ) () + HE 
r —f —_- = ‘ 
9 . 5 T TV D4 T°V(l) 
I 
= (; ~e wt) v(t), (20) 
and 


| l | ‘<a 
EC + - r), k-(1 = *)| = —7(kK.r, k.v| — ry (k.r, k.v] 
= i(k: ° YI (> _ : wr )iev] 
op/L\ 24 
a >, tw l ys a 
= — 27 ow | ——me T+ 5 ur , (21) 


in which we have omitted commutators containing only velocity components, and 
introduced simplifications appropriate to the nature of the angular distribution, 


I 
n-v ~ 1. We should also note that the velocity factors multiplying the exponen- 
‘ Z : 


tials in (9) combine into 


l | ia 
(r( a r)-v(' -_ r)- i)() ” ae (v( 7” : r)+ (! * 7)) (22) 


where the second factor can be approximated by | + (fiw/H). The resulting form 
of (9) is 
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P(n, w, t) = - “(i wt we) f ar (t+ dis r): v(! a 7) - 1) 
tor? ¢ E — "rs 2 2 
“exp | -ie (1 a Ee Y(( - 'nv)t + wrt) (23) 
E c 24 


We have thus obtained the simple result that first order quantum effects are 
included by making the substitution 


he 
wo>a(l+ — (24) 
E 
in the classical formula for w~!P(n, w, t). The same statement applies to w~'P(a, é), 
since the correction factor does not depend upon the emission direction. On re- 
ferring to the classical formula (II.16) of reference 1, we see that the power radiated 
per unit, frequency range about w is given by 


ae * wow 
P(w, t) (=) - ah dnK s(n), (25) 
~ de R\me? 6? SJ w/we(1 + (hw/E)) 


where 


On wal h/me*)®. 


Hence, the first order quantum correction is 


3” 0?/ EF \iwo/w \3 ho, 
to = eB m2) fut 
(w, ape we\w,/ EH Kej(wo/er) 


and the corresponding correction to the total power emerges as 


55 Cf BE \*ha, 
6P(t) = —3""- , WO : =e 


me?) E 


36; Ue 


| dé BKs/() =* ae (29) 


It is thus found that, with regard to the total radiated power, the effect of first 


with the aid of the integral 


9 
order quantum corrections is to multiply the classical value, gwo(e?/ BR) ('/me*)*, 


Dd h/me/ E ) 
> 30 
16 R = a 


' Schwinger, J., Phys. Rev., '75, 1912 (1949). 

2 Parzen, G., [bid., 84, 235 (1951) (corrected by Judd, Lepore, Ruderman, and Wolff, /bid., 86, 
123 (1952), and Olsen, H., and Wergeland, H., [bid., 86, 123 (1952)). Neuman, M., /bid., 90, 682 
(1953). 

3 Schiff, L. I., Am. J. Phys., 20, 474 (1952). 

* While writing this paper, the author was delighted to encounter, in the December, 1953, issue 
of Physics Abstracts, a report of a paper by Sokolov, A. A., Klepikov, N. P., and Ternoy, I. M., 
Doklady Akad. Nauk SSSR, 89, 665 (1953), in which the identical result is quoted as obtained by a 
calculation based on the Dirac equation. 
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